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© Nanbv diagnostics and vaccines. 



© A family of cONA sequences derived from hepatitis C virus (HCV) are provided. These sequences encode 
antigens wh.ch react immunologically with antibodies present in individuals with non-A non-B hepatitis (NANBH) 
but which generally are absent from individuals infected w.ih hepatitis A virus (HAV) or hepatitis B virus (HBV)' 
and also are absent from control individuals. A comparison of these cDNA sequences w.th the sequences in 
Genebank. and w.th the sequences of hepatitis delta virus (HDV) and HBV shows a lack of substantial homology 
A comparison of the sequences of amino acids encoded in the cDNA with the sequences of Flaviviruses 

^ indicates that HCV is a Flavivirus or F!avi-like virus 

CO f 

^ The HCV cDNA sequences are useful for the design of polynudeotioe probes, and for the synthesis of 

polypeptides which may be used in immunoassays. Both the polynucleotide prooes and the polypeptides may 
00 be useful for the diagnosis of HCV-induced NANBH. and for screening blood bank specimens and donors for 
^HCV infection. In addition, these cDNA sequences may be useful for the synthesis of immunogenic polypeptides 

which may be used m vaccines for the treatment, prophylactic and/or therapeutic, of HCV infection Polypeptides 
O encoded w.thin the cDNA sequences may also be used to ra.se antibodies against HCV antigens and for the 

purification of antibodies directed against HCV. antigens. These antibodies may be useful in immunoassays for 
UJ detecting HCV antigens associated with NANBH in individuals, and in blood bank donations. Moreover these 

antibodies may be used for treatment of NANBH in individuals. 

The reagents provided -n the invention also enaoie ihe .soianon of NANBH agemts). ana the prooagaiion of 



theso aoen.is. .n ussu^ cuiiu.e sys.ems Moroovar. iney p.ov.ae -ojyon.s «„., „ , ,o, s „ 

am.v..al agents lo. HCV. pa.hculariy .n i.ssue cuiiure o. ammal mooei sysiems •' ^ ' " " sc-e«„,„g ,or 
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NANBV DIAGNOSTICS AND VACCINFS 



Tecnnical Field 



The invention relates to materials and metnodologies for managing the spread of non-A. non-B hepatitis 
virus (NANBV) .niection. More specifically, a relates to diagnostic DNA fragments, diagnostic proteins 
diagnostic antibodies and protective antigens and antibodies for an etioiogic agent of NANB hepatitis i e 
hepatitis C-ViTus. 
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Background Art 

Non.A. Non-B hepatitis (NANBH) is a tiansm.ss.ble disease or family of diseases that are believed to be 
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v.ral-.nat'ced. and mat aie a.st.ngu.shabte from omer lorms oi v.raiocuoc.aiea hver d-seases. .nciud.ng ,hai 
caused Oy the known nepant.s viruses, i.e.. hepatitis a virus (PAV). hepat.trs B virus (HBV). and delta 
hepatitis virus (HOV). as well as the hepatms induced by cytomegalovirus (CMV) or Epsiem-Bair viru- 
(EBV) NANBH was first .dentif.ed -n transfused individuals- Transmission from .nan to chimpanzee and 
5 scnal passage m chimpanzees provided evidence that NANBH is due to a transmissible infectious agent or 
agents. However, the transmissible agent responsible lor NANBH is still un.deniilied and the number of 
agents which are causative of the disease are unknown. 

Epidemiologic evidence is suggestive that there may be three types of NANBH: the water-borne 
epidemic type: -the blood or needle associated type: and the sporadically occurring (community acquired) 
w type. However, the number of agents which may be the causative of NANBH are unknown. 

Clinical diagnosis and identification of NANBH has been accomplished primarily by exclusion of other 
viral markers. Among the methods used to detect putative NANBV antigens and antibodies are agar-gel 
diffusion, counterir:.. .u..ww.^^My^.:.-'os,s. immunofluorescence micros ;opy, immune electron microscopy, 
radioimmunoassay, and enryme-linkeo immunosorbent assay. However, none of these assays has proved 
'5 to oe sufficiently sensitive, specific, and reproducible to be used as a diagnostic test for NANBH. 

Until now there has been neither cla..ty nor agreement as to the identity or specificity of the antigen 
antibody systems as:"iated with agents of NANBh. This is due. at least in part, to the prior or co-mlection 
of HBV with NANBV in individuals, and to the known complexity of the soluble and particulate antigens 
associated with HBV. as well as to the integration of HBV DNA into the genome of liver cells. In addition 
20 there is the possibility that NANBH is caused by more than one infectious agent, as well as the possibility 
that NANBH has been misdiagnosed. Moreover, it is unclear what the serological assays detect in the 
serum of patients with NANBH. It has been postulated that the agar-gel diffusion and counterimmunoeiec- 
trophoresis assavs detect autoimmune responses or non-specific protein interactions that sometimes occur 
between serum specimens, and that they do not represent specific NANBV antigen-antibody reactions. The 
?5 immunofluorescence, and enzyme-linked immunosorbent, and radioimmunoassays appear to detect low 
levels of 3 rheumatoid-facior-like material that is frequently present in the serum of patients with NANBH as 
wen :is in patients with other hepatic and nonhepatic diseases. Some or the reactivity detected may 
represent antibody to host-determined cytoplasmic antigens. 

There are a number of candidate NANBV. See. fo- example the reviews by Prince (1983), Femsicne 
30 and Hoofnagle (1984). and Overby (1985. 1986. 1987) and the article by Iwarson (1987). However, there is 
no proof that any of these candidate* represent the etiological agent of NANBH. 

The demand for sensitive, specific methods for screening and identifying carriers of NANBV and 
NANBV contaminated blood or blood products is significant. Post-transfusion hepatitis (PTH) occurs in 
aoproximately 10% of transfused patients, and NANBH accounts for up to 90% of these cases. The major 
as problem m this disease is the frequent progression to chronic liver damage (25-55%). 

Patient care as well as the prevention of transmission of NANBH by blood and blood products or by 
close personal contact require reliable diagnostic and prognostic tools to detect nucleic acids, antigens and 
antibodies related to NANBV. in addition, there is also a need for effective vaccines and immunotherapeutic 
therapeutic agents for the prevention and'or treatment of the disease. 



Disclosure of the Invention 

The invention penains to the isolation and characterization of a newly discovered etioiogic agent of 
45 NANBH. hepatitis C virus (HCV). More Specifically, the invention provides a family of cDNA replicas of 
portions of HCV genome. These cONA replicas were isolated by a technique which included a novel step of 
screening expression products from cDNA libraries created from a particulate agent m infected tissue with 
sera from patients with NANBH to detect newly synthesized antigens derived from the genome of the 
heretofore unisolated and uncharacterized viral agent, and of selecting ciones which produced products 
50 which reacted immunologically only with sera from infected individuals as compared to non-infected 
individuals. 

Studies of the nature of the genome of the HCV, utilizing probes derived from the HCV cONA. as well 
as sequence information contained witnin the HCV cONA. are suggestive that HCV is a Flavivirus or a Ravi- 
like virus. 

55 Portions Of the cONA sequences derived from HCV are useful as probes to diagnose the presence of 
virus in samples, and to isolate naturally occurring variants of the virus. These cDNAs also make available 
polypeptide sequences of HCV antigens encoded within the HCV genome(s) and permits the production of 
polypeptides which are useful as standards or reagents m diagnostic lests and/or as components of 
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vacc.r.es. Anhood-es. ooin polyclonal and monoctonni. d.ieciod aga.nsi HCV op.iopes comamed w.in.n these 
polypepLoe sequences are also useful lor d.agnosi.c tesr*;. as merapeui.c agents, lor screening ol ar^i.v.ral 
agents, ana for the .solai.on ol the NAN8V agjni from wh.ch these cDNAs dcnve. In aod.i.on Py ut.l,>,nQ 
proPes derived from these cDNAs it rs possible to isolate and sequence other port-ons ol me HCV genome 

5 thus g.v.ng rise to additional probes and polypeptides which are useful m me diagnos.s ana or treatment' 
both prophylactic and inerapeutic. of NANBH. 

Accordingly w.ih respect to polynucleotides, some aspects of the invention are. a purified Hcv 
polynucleotide: a recombinant HCV polynucleotide: a recombinant polynucleotide comprising a sequence 
derived from an HCV genome or from HCV rQNA: a recombinant polynucleotide encoding an epitope of 

'0 HCV: a recombinant vector containing the any of the above recombinant polynucleotides, and a host celt 
transformed with any of these vectors. 

^ Other aspects of the invention are: a recombinant expression system comprising an open reading frame 
(ORF) of ONA derivp^ Mrv genome or from HCV cDNA. wher*.in the ORF is operably linked to a 

control sequence compatible with a c-sired host, a cell transformed w.th the recombinant expression 
:5 system, and a polypeptide produced by the transformed cell. 

Still other aspects of the invention are- purified HCV. a preparation of polypeptides from the purified 
HCV: a purified HCV nolypeptide: a purified polypeptide cjmprismg an epitope which ,s .mmunoiog.cally 
lOentifiable with an epitope contained in HCV. 

Included aspects of the invention are a recombinant HCV polypeptide: a recomomani polypeptide 
20 comprised Of a sequence derived from an HCV genome or from HCV cONA: a recombinant polypeptide 
comprised of an HCV epitope: and a fusion polypeptide comprised of an HCV polypeptide. 

Also included in the invention are a monoclonal antibody directed against an HCV epitope: and a 
purified preparation of polyclonal antibodies directed against an HCV epitope. 

Another aspect of the invention is a particle which is immunogenic against HCV infection comprising a 
35 non-HCV polypeptide having an amino acid sequence capable of forming a particle when said sequence is 
produced in a eukaryotic host, and an HCV epitope. 

Still another aspect of the invention is a polynucleotide probe for HCV. Z 
Aspects of the invention which pertain to kits are those for: analyzing samples far the preserve of 
polynucleotides derived from HCV comprising polynucleotide probe containing ^ nucleotide sequence 
30 from HCV of about 8 or more nucleotides, in a suitable container: analyzing samples for. the presence of an 
HCV antigen comprising an antibody directed against the HCV antigen to be detected, in a suitable 
container; analyzing samples for the presence of an antibodies directed against an HCV antigen comprising 
a polypeptide containing an HCV epitope present in the HCV antigen, in a suitable container. 

Other aspects of the invention are: a polypeptide comprised of an HCV epitope, anached to a solid 
35 substrate: and an antibody to an HCV epitope, attached to a solid substrate. 

Still other aspects of the invention are: a method for producing a polypeptide containing an HCV 
epitope comprising incubating host cells transformed with an expression vector containing a sequence 
encoding a polypeptide containing an HCV epitope under conditions which allow expression of said 
polypeptide: and a polypeptide containing an HCV epitope produced by this method- 
ic The invention also includes a method for delecting HCV nucleic acids m a sample comprising reacting 
nucleic acids of the sample with a probe for an HCV polynucleotide under conditions which allow the 
formation of a polynucleotide duplex between the probe and the HCV nucleic acid from the sample: and 
delecting a polynucleotide duplex which contains the probe. 

Immunoassays are also included in the invention. These include an immunoassay for detecting an HCV ' 
^5 antigen comprising incubating a sample suspected of containing an HCV antigen with a probe antibody 
directed against the HCV antigen to be detected under conditions which allow the formation of an antigen- 
antibody complex: and detecting an antigen-antibody complex containing the probe antibody. An im- 
munoassay for detecting antibodies directed against an HCV antigen comprising incubating a sample 
suspected of containing anti-HCV antibodies with a probe polypeptide which contains an epitope of the 
50 HCV. under conditions which allow the formation of an antibody-antigen complex: and delecting the 
antibody- antigen complex containing the probe antigen. 

Also included in the invention are vaccines for treatment of HCV infection comprising an immunogenic 
peptide containing an HCV epitope, or an inactivated preparation of HCV. or an attenuated preparation of 
HCV. 

55 Another aspect of the invention is a tissue culture grown cell infected with HCV. 

Yet another aspect of the invention is a method for producing antibodies to HCV comprising administer- 
ing to an individual an isolated immunogentc polypeptide containing an HCV epitope in an amount sufficient 
to produce an immune response. 
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Sl,ll anomer aspect o( the .nvent.on ,s a method (or .so.a.mg cDNA aen.ed (,om the genome of an 
un.den..(.ea .nfect.ous agenl. compr.s. ,g: (a, prov.dihg host cells translormoa wth o.press.on Jcar^ 
con.a.n,ng a cDNA i.prary prepared from r^ucle.c acas .solaioo Irom t.ssue inlecied wth the aoen^anl 
grow,ng sa.d host cells ur.der cor,difor,s which allow express.on pi poiypepi.de(s) encoded m the cDnI 
imeracfng the e.press-on products ot the cONA w.th an ant.body conta.n.rg body component o( an 
.nd.v.duai .niected w„h sa,d .niect.ous agent under conditions which allow an .mmunoreaction. and ditecfna 

A T'" " ' 9'<'-"9 Which e priss 

poiypepfdes that lorm annbodyaniigen complexes in step (b, under condit.ons which allow the.r growm ' 
.nd|v.dual Clones and isolating sa.d clones: (d) growng cells (rom the clones ol (c) under conaaions wh." 
allBW expression 01 polypeptide(s) encoded within the cONA. and .nteract.ng the expression products wjh 
ant^ody con.a.n.ng body components of individuals other than the. .nd.vidual in step (a) who are infected 
w,th the mlecfous aoent and with control individuals uninfected with the agent, and detecting antibody- 
am.9en complexes formed as - -esult of .he interacting; (e) grow.ng host cells wh.ch express poiypepfdes 
that form an„body-ant,gen complexes w„h antibody containing body components of mlected indiv.duals and 
ct:?"!' Tl'T H '""^ "'^ components of control individuals, under 

ITTI T r : " ""^ 3"<3 <'» 'scaling the cDNA 

from the host con clones ol (e). » « v-u ^« 



J5 Sfief Oescripiion of me Drawings 

Fig. 1 shows the double-stranded nucleotide sequence ol the HCV cDNA .nsea .n clone S-i-i and 
the putative ammo acid sequence of the polypeptide encoded therein. 

91 ^ ""^ homologies of the overlapping HCV cDNA sequences m clones 5-i-i. 81. 1-2. a.id 

Fig. 3 Shows a composite sequence of HCV cDNA derived from overlapping clones 31 i-2 and 9i 
and the ammo acid sequence encoded therein. 

F.g. 4 Shows the double-stranded nucleotide sequence of the HCV cONA insert .n don^ 8i and the 
putative ammo acid sequence cf the polype»«ide encoded therein 
30 Fig. 5 Shows the HCV cONA sequence in clone 36. the segment wh.cn overlaps the nanbv cDna of 

clone 81 . and the polypeptide sequence encoded within clone 36. 

Fig. 6 shows the combined ORF of HCV cONAs in clones 36 and 8i. and the polypeptide encooec 

inQrQin . 

Fig. 7 shows the HCV cDNA sequence in clone 32. the segment which overlaps done 8i and the 
35 polypeptide encoded therein. 

Fig. a Shows the HCV cDNA sequence in done 35. iha segment which overlaps clone 36 and thP 
polypeptide encoded therein. 

Fig. 9 Shows the combined ORF of HCV cDNAs in clones 35. 36. 8i. and 32. and the polypeptide 
encoded therein. 

^ Fig. 1G shows the HCV cDNA sequence in clone 37b. the segment wh.ch overlaps clone 35 and the 

polypeptide encoded therein. 

Fig. 1 1 shows the HCV cONA sequence in done 33b. the segment wh.ch overlaps clone 32 and the 
polypeptide encoded therein. 

Fig. 12 Shows the HCV cONA sequence in done 40b. the segment wh.ch overlaps clone 37b and the 
••5 polypeptide encoded therein. 

Fig. 13 shows the HCV cDNA sequence in clone 25c. the segment which overlaps done 33b and the 
polypeptide encoded therein. 

Fig. 14 Shows the nucleotide sequence and polypeptide encoded therein of the ORF wh.ch extends 
through the HCV cDNAs in clones 40b. 37b. 35. 36. 81. 32. 33b. and 25c. 
so Fig. 15 Shows the HCV cDNA sequence in done 33c. the segment which overlaps clones 40b and 

33c. and the amino acids encoded therein. 

Rg. 16 shows the HCV cDNA sequence in clone 8h. the segment which overlaps done 33c and the 
amino acids encoded therein. 

Fig. 17 Shows 'J>e HCV cDNA sequence in done 7e. the segment which overlaps clone 8h and the 
ammo acids encoded therein. 

Fig. 18 Shows the HCV cDNA sequence m clone I4c. the segment wh.cn overlaps done 25c and the 
ammo acids encoded therein. 



Fig 19 snows the HCV cONA seouence m nnnt- SI inc 5<»nr^.»„. k , 

ovjuciiue in Clone oi. wie segnen- whicn ovenaps clone Ur a.in ik- 
ammo acids encoded iharem. "'^^ '""^ 

Fig. 20 Shows the HCV cDNA se^jence ,n clone 331. me segment wh.ch overlaps cion. si .n^ 
amino acids encoded therein. ' " 

5 Fig. 21 Shows the HCV cDNA sequence m clone 33g. the segment wh.ch overlaps clone 331 ;,„h ,h 

ammo acids encoded therein. • 

Fig. 22 Shows the HCV cDNA sequence .n done 7f. the segment wh.ch overlaps the seouenr,. o 
Clone 7e. and the amino acids encoded therein. sequence m 

Fig. 23 Shows the HCV cDNA ser^uence in clone lib. the segment wh.ch overlaps the sequence -n 
10 clone 7f. and the ammo acids encoded therein. s=Muonce m 

"T'J ^V?"' "^"^ "^'""^ •^'"^^ o-^^'aPs the sequence m 

Clone 1 1 b. and the ammo acids encoded therein. 

Fig. 25 ^h.-c ..o Hrv cDNA sequence in clone 39c. the segment wh.ch overlaps the sequence .n 
Clone 33g. ana the ammo acids encoded therein. sequence m 

" , ^Ik '''n''' ^ compos-ie Hcv cDNA sequence der.ved (rom the aligned cDNAs m clones I4i np 
7e. 8h. 33c 40b 37b 35 36. 8l. 32. 33b. 25c. Uc. 8f. 33f. 339 and 39c also shown .s the ammo Idd 
sequence of the oolypeptide encoded in the er^ondec ORF in the derived sequence 

Fig. 27 Shows the sequence ot the HCV cONA in clone i2f. the segment wh.ch overlaps clone 14, 
and the ammo acids encoded therein. 

:o Fig. 28 shows' the sequence of the HCV cONA m done 35f. the segment wn.ch overlaps clone 39c 

and the ammo acids encodeo therem. 

Fig. 29 shows the sequence of iha HCV cDNA in done i9g. the segment wh.ch overlaps clone 35f 
and the amino acids encoded therein. 

Fig. 30 shows the sequence of clone 26g. the segment which overlaps clone i9q. and tne ammo 
:5 acids encoded therein. ^ ° 

Fig. 31 Shows the sequence of clone I5e. the segment wh.ch overlaps done 26q. ana the ommo 
acids encoded therein. ^ 

Fig. 32 Shows the sequence :n a composite cDNA. wh.ch was derived by al.gnmg clones I2f througn 
i5e m the 5 to 3 direct.on: it also shows t^ 3 amino acids encodeo in the continuous ORF 
30 F.g. 33 shows a photograph of Western blots of a fusion protem. SOD-NANB. . with chimpanzee 

serum from chimpanzees infected with BB-NANB. HAV. and HBV. 

Fig, 34 shows a photograph of Western blots of a fusion protc.n. SOD-NANB« - w.ih serum from 
humans mfected with NANBV. HAV. HBV. and from control humans. 

Fig. 35 is a map showing the significant features of the vector pAB24 

^ ^^"^^^ ^""'^ sequence of the carboxy-term.nus of the fusion polypeptide 

C 100-3 and the nucleotide sequence encoding it. 

Fig. 37A is a photograph of a coomassie blue stained polyacryiamide gel wh.ch identifies CiOO-3 
expressed in yeast. 

Fig. 37B shows a Western blot of CiOO-3 with serum from a NANBV inferted human. 
Fig. 38 shows an autoradiograph of a Northern blot of RNA isolated from the liver of a 3B-NANBV 
infected chimpanzee, probed with B8-NANBV cDNA of clone 8r 

F.g. 39 shows an autoradiograph of NANBV nucleic acid treated with RNase A or DNase I and 
probed with BB-NANBV cDNA of clone 81 . ' 

Fig. 40 Shows an autoradiograph of nucleic acids extracted from NANBV panicles captured from 
infected plasma with anti-NANBs-t and probed with 22p.iabeied NANBV cDNA from clone 81. 

Fig. 41 Shows autoradiographs of filters containing isolated NANBV nucleic acids, probed with ^^P- 
labeled plus and minus strand DNA probes derived from NANBV cDNA in clone 81. 

Fig. 42 shows the homologies between a polypeptide encoded in HCV cDNA and an NS protein from 
Dengue flavivirus. 

Fig. 43 Shows a histogram of the distribution of HCV infection in random samples, as determined bv 
an ELISA screening. ' 

Fig. 44 shows a histogram of the distribution of HCV infection in random samples using two 
configurations of immunoglobulin-enzyme conjugate in an ELISA assay. 

Fig. 45 shows the sequences in a primer mix. derived from a conserved sequence in NSi of 
55 fiaviviruses. 

Fig. 46 shows the HCV cONA sequence in clone k9-i. the segments which overlaps the cDNA .n F,g. 
26. and the amino acids encoded therein. 
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F.g 47 Shows me seouence m a cc^posiie cDNA wh.cM was der.voU by ahyn.ng clones kO-i ihrourjn 
I5e in tne 5 to 3 direcnon. n aiso showa mo ammo at -as encoOftd 'n iho conimuous ORF 
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The term "hepatitis C virus" has been reserved by workers m the field lor an heretofore unknown 
- ei.ologic agent of NANBH. Accordingly, as used here- nepatitis C virus" (HCV) refers to an agent causittve 
of NANBH. which was formerly referred to as NANBV and/or BB-NANBV. The terms HCV. NANBV. and BB- 
NANBV are used interchangeably herein. Ab d-^ extension of this terminology, tne disease caused by HCV 
formerly called NANB hepatitis (NANBH). is caiifcvi hepatitis C. The terms NANBH and hepatitis C may be 
.'i used interchangeably herem. 

The term "HCV". as used herein, denotes a virai species whir*- causes NANBH. and anenuated strains 
or defective interfering particle:^ derived therefrom. As shown infra., the HCV genome is comprised o( RNA. 
It ts known mat RNA containing viruses have relatively high rates of spontaneous mutation, i.e.. reportedly 
on me order of iQ-^ to lO'* per nucleotide (Fields & Knipe. (1986)). Therefore, there are multiple strains 
20 within the HCV species described infra. The compositions and methods described herein, enatjie me 
propagation, identification, detection, and isolation of the various related strains. Moreover, they also allow 
me preparation of diagnostics and vaccines for me various strains, and have utility in screening procedures 
for anti-viral agents for pharmacologic use tn that they inhibit replication of HCV. 

The information provided herem. although derived from one strain of HCV. hereinafter referred to as 
.'5 CDC. HCV 1. .s sufficient to allow a viral taxonomist to identify other strains which fall within the species. As 
described herem. we have discovered that HCV is a Flavivirus or Flavi-like virus. The morphology and 
composition of Flavivirus panicles are known, and are discussed in Brinton (1986). Generally, with respect 
to morphology. Flaviviruses contain a central nucleoc3psid surrounded by a lipid bilayer. Virions are 
spherical and have a diameter of about 40*50 nm. Their i.orcs are about 25-30 nm in diameter. Along me 
bo outer surface of the virion envelope are projections that are about 5-i0 nm long with terminal knobs about 2 
nrri in diameter. 

HCV encodes an epitope which is immunologically identifiable with an epitope in the HCV genome from 
which the cDNAs described herein are derived: preferably me epitope is encoded in a cDNA described 
herein. The epitope is unique to HCV when compared to other known Flaviviruses. The uniqueness of the 
epitope may be determined by its immunological reactivity with HCV and lack of immunological reactivity 
with other Flavivirus species. Methods for determining immunological reactivity are known in the an. for 
example, by radioimmunoassay, by Elisa assay, by hemagglutination, and several examples of suitable 
techniques for assays are provided herein. 

in addition lo the above, the foltowmg parameters are applicable, either atone or in combination, in 
identifying a strain as HCV. Since HCV strains are evoiutionarily related, it is expected that the overall 
homology of the genomes at the nucleotide level will be about 40% or greater, preferably about 60% or 
greater, and even more preferably about 80% or greater; and in addition that there will be corresponding 
contiguous sequences of al least about 13 nucleotides. The correspondence between the putative HCV 
strain genomic sequence and the CDC/CHI HCV cONA sequence can be determined by techniques known 
J5 in the art. For example, they can be determined by a direct comparison of me sequence information of the 
polynucleotide from the putative HCV. and me HCV cDNA sequence(s) described herein. For example, also, 
they can be determined by hybridization of the polynucleotides under conditions which form stable 
duplexes between homologous regions {for example, mose which would be used prior to Si digestion), 
followed by digestion with single stranded specific nuclease(s). followed by size determination of the 
50 digested fragments. 

Because of the evolutionary relationship of the strains of HCV. putative HCV strains are identifiable by 
their homology at the polypeptide level. Generally. HCV strains are more man about 40% homologous, 
preferably more than about 60% homologous, and even more preferably more than about 80% homologous 
at the polypeptide level. The techniques for determining amino acid sequence homology are known in the 
55 art. For example, the amino acid sequence may be determined directly and compared to the sequences 
provided herein. For example also, the nucleotide sequence of the genomic material of the putative HCV 
may be determined (usually via a cDNA intermediate): the amino acid sequence encoded therein can oe 
determined, and the corresponding regions compared. 
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...r ,".T ' P"'y""C'eo..clo -oe,.v«o Irom' a des.gna.e. seqo.nco. lo, c.„,„p,.. hcv cDna 

par„cu.a,.y tnoie uxemp,.„ea ... F.gs. ,.32. or .,om an HCV genome. ,e.ers ,0 a PO-ynuCeoae seQuenc. 
wh.ch .s compr,seo 0. a sequence c. app. ,.,rT,a.ely a. -eas, abou, 6 nuceo-oes. . P-e.e.abiy a'Vat? aoo ! 
8 nuc.eo..qes. .= mo-e preferably a. leas. abou. .0-2 nuCeoi.des. and even more pro.erao.y a eas a^^ 
. -5-20 nuc:eo,.aes co.respond.ng.. . e.. homologous .0 or comp.emen.ary ,0. a reg.on 0. ha ces ' na,a". 

homologous .0 or complementary ,0 a sequence wh.ch ,s un.que .0 an HCV genom« Whe.her o 'o. ! 
sequence ,s un.que .0 Ihe HCV genome can be c1e.erm.ned by .echn.ques known .0 .hose of s..n n ne Ir, 
Fo, example, .he sequence c.n be comoared .0 sequences -n da.abanks. e.g.. Geneban.. ,0 de.Lrmfne 
.0 whether ,s-presen, .n me un.nlec.ed host or other organ.sms. The sequence can also be compareTto Z 

.h!l 1 °' "Th* correspondence or non-correspondence o. 

the derived se>.er,.« ,n n>-,r sequences can also be oe.ermined by hybr.d.zat.on under the approprta^e 

,5 ouTeTa IT'"" T'"'^^'"''" complemen,ar.,y 0. nuCe.c aaS s^ 

l.hn o «"""f H , '^-^'^ r^ybndiza..on can be determined by Known 

TZZ l ? °' = "-^'^^^ ^"^'^ « S, that speCically d.ges.s s.ng.e- 

s.randed areas .n dup.ex polynuCeot-des. Regions from wh.ch typ.cal DNA sequences may be "der-ved" 

.0 IndTonn r?'H""'"' '°' '°' '^5'°"' ^"='^'"3 « "On..ranscr.bed 

20 and^or non-translated regions. 

The der.ved po.ynucieot.de ,s not necessarily physically der.ved from the nucleotide sequence shown 
but may be generated ,n any manner, .nc.uding for example, chem.cal synthes.s or DNA repl.cat.on or" 
reverse transcnp„on or transcription, which are based on the inlormat.on provided by the sequence 0. 

..T,n'r H '^^ T^ T in addition. comb.na..ons n. reg.ons 

.5 correspond.ng ,0 that of the designated sequence may be modified in ways xnown ,n the art to be 
consistent with an intended use. . w an lo oe 

Similarly, a polypeptide or ammo aod sequence "derived from" a designated nucleic acd sequence 
for example, the sequences in Figs. ,-32. or from an HCV genome, refers .0 a polypeptide having-an am "0 

.0 ooln T'"f '° " ' ^""'^'^ ' "'^'eo' -''e^ein the 

30 portion cons.sts of at least 3-5 ar.ino acids, and more preferably at leas. 8-1O am.no acids, and even more 
preferably at least n-15 amino acids, or which is immunologically identifiable with a polypeptide encoded 

•n tne S6Qu6nce. 

A recombinant or derived polypeptide is not necessarily translated from a designated nucleic acd 
sequence, lor exanpie. the sequences in Figs. 1-32. or from an HCV genome: it may be generated in any 

-Toron fTom^red HcT'- °' ' -^-^^ - 

The term 'recombinant polynucleotide" as used herein intends a polynucleotide of genomic cONA 
semisynthetic, or synthetic origin which, by virtue of its origin or manipulation: (I) is not associated with ali 
or a port.on of the polynucleotide with which it is associated in nature or in the form of a library; and/or (2) 
40 IS linked to a polynucleotide other than that to which it is linked in nature 

The term -polynucleotide" as used herein refers to a polymerjc form of nucleotides of any length, either 
r-bonuc^otides or deoxyribonucleotldes. This term refers only' to the primary structure of the molecule 
Thus. th.s term .ncludes double- and single-stranded DNA. as well as double- and single stranded RNA It 
also .ncludes modified, for example, by methylation and/or by capping, and unmodified forms of the 
4$ polynucleotide. 

As used herein, the term "HCV containing a sequence corresponding to a cDNA" means that the HCV 
contains a polynucleotide sequence which is homologous, to or complementary to a sequence in the 
designated DNA; the degree of homology or complementarity to the cDNA will be approximately 50% or 
greater, w.ll preferably be at least about 70%. and even more preferably will be at least about 90% The 
sequences which correspond will be at least about 70 nucleotides, preferably at least about 80 nucleotides 
and even more preferably at least about 90 nucleotides in length. The correspondence between the HCV 
sequence and the cDNA can be determined by techniques known in the art. including, for example a direct 
companson of the sequenced material with the cDNAs described, or hybridization and digestion wi"th single 
strand nucleases, followed by size determination of the digested fragments. 
55 The. term ^purified of vral polynucleotide" refers to an HCV genome or fragment thereof which ,s 
essentially free. i.e.. contains less than about 50%. preferably less than about 70%. and even more 
preferably less than about 90% of polypeptides with which the viral polynucleotide ,s naturally associated 
Techniques for punfying viral polynucleotides from viral particles are known ,n the art. and .nclude for 
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Zrr!!!L'''''T°" °' ' '9°"'- °' 'f^e Po.ynucico..ae,s, ano 

OoZy chroma,09,aph ... amn.iy chroma.ograpny. and sed,men,a,.on acco!i!ng "o 

. Jn'!.."7 P^'^P^P''^^"^" 'e'ers 10 an HCV polypepi.de or (ragmen, .hereol wh.ch ,s 

5 essen,.a y ree. ..e.. con,a.ns less than aPou. 50'... preferably less than apou. /O".. and «ven mor' 
preferably less .han about 90-.. o. cellular corrponen.s w,m which ,he v.raf polypep,.de .s na.uTa 1 
assooaied. Techn.pues lor pur.ly.ng .ral polypept.des are Known .n ,ne art. and examples o7 he e 
techniques are discussed infra. h » oi inese 

,0 d«nr^?onTr'"' ^^^^ ^-Cf^ terras 

ro den(«,ng rn^roorgan-sms or h,gher eukaryot.c cell lines cultured as un.cellular entities refer ,o cells wh.ch 
can be. or have been, used as recipients lor recompinant vector or other transfer DNA. and include the 
nl?n ^^'^nstecxea. I, ,s understood that the progeny of a single 

parental ce 1 may no. necessarily completely identical ,n morpho-ogy or in genomic or total ONA 
complement as the original parent, due accidental or deliberate mutation. Progeny of the parental cell 
which are sufticientiy similar to the parent to be characterized by the relevant property such as "e 
presence of a nucleotide sequence encoding a Oes.red pe-.de. are included in the progeny intended by 
this definition, and are ;..jvered by the above terms. h y y lo oea oy 

. mil?''""." 1 * P'""^'''- ' <='^^0'"o«m9. a virus, that behaves as an 

autonomous unit of polynucleotide replication within a cell; i.e.. capable of replication under its own control 
. 'S a rephcon in which another polynucleotide segment is anached. so as to bring about the 

replicatipn and-or expression of the attached segment. 

-Control sequence- refers to polynucleotide sequences which are necessary ,o effect the expression of 
coding sequences to which they are ligated. The nature of such control sequences differs depending upon 
he host organism: .n prokaryotes. such control sequences generally include promoter, nbosomai binding 
site, and terminators: m eukaryotes. generally, such control sequences include promoters, terminators and 
.n some instances, enhancers. The term "control sequences" is intended to include, at a minimum all' 
components whose presence ,s necessary lor expression, and may also include addition.; components 
whose presence :s advantageous, for examole. Ie?der sequences. 

■Operably linked" relers to a juxt.^position where.., the components so described are .n a relationship 
permitting them tp function in their intended manner. A control sequence "operably linked" to a codinq 

^o^JI'm " ' ^''P^^"*"" °' co-^'^g sequence is achieved under conditions 

compatible with the control sequences. 

An -open reading Irame" (ORF) is a region of a polynucleotide sequence which encodes a polypeptide- 
this region may represent a portion of a coding sequence or a total coding sequence 

A "coding sequence" is a polynucleotide sequence which is transcribed into mRNA and.or translated 
■mo a polypeptide when placed under the control of appropriate regulatory sequences. The boundaries of 
the coding sequence are determined by a translation start codon at the s'-terminus and a translation stop 
codon at the 3 -terminus. A coding sequence can include, but is not limited to mRNA cDNA and 
recombinant polynucleotide sequences. 

-immunologically identifiable with/as- refers to the presence of ep.tope(s) and pofypeptides(s) which are 
also present in and are unique to the designated polypeptide(s). usually HCV proteins. Immunological 
idenuty may be detemiined by antibody binding and/or competition in binding: these techniques are known 
to those of average skill in the art. and are also illustrated infra. The uniqueness of an epitope can also be 
determined by computer searches of known data banks, e.g. Genebank. (or the polynucleotide sequences ' 
which encode the epitope, and by amino acid sequence comparisons with other known proteins 

As used herein, -epitope" refers to an antigenic determinant of a polypeptide: an epitope could 
compnse 3 am.no acids in a spatial conformation which is unique to the epitope, generatly an epitope 
consists of at least 5 such amino acids, and more usually, consists of at least 8-10 such ammo acids 
Methods of determining the spatial conformation of amino acids are known in the an and include lor 
example, x-ray crystallography and 2-dimensional nuclear magnetic resonance. 

A polypeptide is 'immunologically reactive* with an antibody when it binds to an antibody due to 
antibody recognition of a specific epitope contained within the polypeptide. Immunological reactivity may be 
determined by antibody binding, more particularly by the kinetics of antibody binding, and/or by competition 
in binding using as competitor(s) a known oolypeptide(s) containing an epitope against which the antibody 
iS directed. The techniques for determining whether a polypeptide is immunologically reactive with an 
antibody are known iri the art. 

As used herein. :he term -immunogenx polypeptide containing an HCV epitope- includes naturallv 
occurring HCV polypeptides or ;ragm«nts thereof, as well as polypeptides prepared by other means, for 
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eompie. cnem.cai syn.hes.s. or .ne e.p,ess.on o. ihe poiypep.-ae ,n a .ecomb.n3n. organ.sm 

.«nnm „,T« 'P'"'^" '° ' ^'"'"^ acas and does no> refer ,o a specW.c 

7 °' °"9opep,.oes. and pro.e,r,s are .ncluoed w„h.r, me def.n ,fon 

po-ypep -de. Th.s ,erm also does no. refer ,o pos.-express-on rT,od,t,ca..on. ol .he poiypep„de ior e ai.e 
s glycosylations. acetylat.ons. phosphorylations and the like. or example 

■Trans»orrna,.on-. as used herein, refers .0 .he .nsen.on of an exogenous polynucleotide .n,o a host ceil 
.rrespecve of the method used for ,he msert.on. for example, direct up.ake. transducon. or l^a.'ng The 

'"^^ ''^ '^^'"'^•"^^ « a non-in,egra.ed vector, for example, a plasmid o! 
alternatively, may be integrated into the host r.anome. t^asmio. or 

0 -Treaimenl" as used herein refers 10 prophylaxis anU/or therapy 

andl'nc.ultif?'"' TT' '° ^^'^^t^^'^^' P^^^-^^'^^'V -embers of the mammalian speces 

and includes but .s no. I.mi.ed to domestic animals, sports animals, primates, and humans 

Th« -^l!,!*f clri"." •'" ^ """'"^ ^^-^ ^""''es the polypeptide 

The minus sirand contains a sepuencc -vhich .s complementary .0 that of the "plus strand" 

* HMA I'*"'"' ' 9'""""*' °' ^ '5 "^'<:^ '^e genome, whether RNA or 

. single-stranded and which enc..,s a viral polypeptide(s). Examples of positive sJanded RNA 
viruses include Togav -ae. Coronaviridae. Re.rov.ri..e. Picomav.ndae. and Calicivrrdae. Included also ari 
.he Flavivridae. wh.ch were formerly classified as Togaviradae. See Fields & Knipe (1985) 

, «h>t'."t*? ''""^^ containing body component" refers .0 a component of an indiv.duaCs body 

anT^nlde h? ^"'^^o^V co"'3.ning body components are .nown in tne art 

and include but are no. limited to. tor example, plasma, serum, spinal fluid, lymph fluid the externa 
sections of the respiratory, in.es.inal. and genitourinary tracts, .ears, saliva, milk, white blood cells, and 

consmue'r^c ''T^'T!:^'' ' °' "^v which has been isolated from the cellular 

ITZT. V^ ^ T " ^"-^ "^"^ ^"'^^ <" -'^ses which may be 

present in .he infected tissue. The .echmques for isolating viruses are Known ,0 those o( sk.i. .n the an and 
^nclude. lor example, centr.lugation and affinity chromatography: 3 method of propar.ng punf,ed HCV .s 



II. Description of the Invention 



n, rJ^l ! l , the present .nven„on will employ, unless otherwise indicated, conventional techniques 
of molecular biology, microbiology, recombinant DNA. and immunology, which are w.,hin the sk.U of the an 
Such techniques are explained fully in the literature. See e.g.. f^aniatis. Fitsch & Sambrook MOLECULAR 
CLONING: A LABORATORY MANUAL ,1982): DNA CTONING. VOLUMES . AND M ,D G over ed 1 9^5. 
OUGONUCLE0T.de SYNTHESIS ,M.j. Gait ed. 1984): NUCLEIC ACD HYBR.DIZATiSn B D Han^e! & 
S.J. Higg.ns eds. 1984): TRANSCRIPTION AND TRANSLATION (B.D. Names & SJ Higg ns eds 1984V 
ANIMAL CELL CULTURE (R.l. Freshney ed. 1986): IMMOBILIZED CELLS InD ENZYMES MRL pfe« 
B^JoLC^V r^'^'^o^'- '° ^^OLECULAR CLONING ,,984): the series. METHOOs 'n 

ENZYMOLOGY (Academic Press, inc.): GENE TRANSFER VECTORS FOR MAMMALIAN CELLS (JH 
Minerand M^P^Calos eds. 1987. Cold Spring Harbor Laboratory), Methods m Enzymofogy Vol. 154 and Vol 
MPTS^rfM r^t^rr^n".^^ respectively). Mayer and Walker, eds. (1987). IMMUNOCHEMICAL 

METHODS IN CELL AND MOLECUUR BIOLOGY (Academic Press. London) Scopes (1987) PROTpin 
PURIFICATION: PRINCIPLES AND PRACTICE. Second Edition (Springer Ve lag N Y ) and HANDBOOK 
OF EXPERIMENTAL IMMUNOLOGY. VOLUMES I-IV (D.M. Weir and C. C. Blackwell eds "1986) 

All patents, patent applications, and publications mentioned herein, both supra and infra, are hereby 
incorporated herein by reference. ' 

The useful materials and processes of the present invention are made possible by the provision of a 
family of closely homologous nucleotide sequences isolated from a cDNA library derived from nucleic acid 
sequences present in the plasma of an HCV infected chimpanzee. This family of nucleotide sequences is 
not of human or chimpanzee origin, since it hybridizes to neither human nor chimpanzee genomic DNA 
from unmfected individuals, since nucleotides of this family of sequences are present only in liver and 
plasma of chimpanzees with HCV infection, and since the sequence is not present in Genebank. In addition 
the .amily of sequences shows no significant homology to sequences contained within the HBV genome 

The sequence of one member of the family, contained within done 5-1- 1. has one continuous open 
reading frame (ORF) which encodes a polypeptide of approximately 50 amino acids. Sera from HCV 
infected humans contain antibodies wh.ch bind to th.s polypeptide, whereas sera from non-inlecteo humans 
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00 no; coma.n anuDod.es lo m.s polypept.oe. F.naiiy. whereas the sera Irom un.nfecied ch.mpan^ees co noi 
contain am.bod.es to ih.s polypeptide, the o..i.Ood.es are .nduced .n chimpanzees foUDw.ng acute NANBh 
.ntect.on. Moreover, ani.bod.es to ih.s polypepi.de are not detected .n ch.m(;s and humans .nlecied w.th 
HAV and H8V. By these cnier.a the sequence is a cDNA to a v.rai sequence, where.n the v.rus causes or ,s 
assoc.ated w.th NANBH: ih.s cONA sequence is shown .n Fig. i. As discussed .ntra. the cONA sequence .n 
clone 5-1-1 differs from that of the other isolated cONAs in that it contains 28 extra base pa.rs. 

A connposite of other identified members of tho cDNA family, which were isolated us.ng as a probe a 
synthetic sequence equivalent to a frcjmer* of the cDNA in^lone 5-1-1, ,s shown m Fig, 3. A member of 
the cDNA family which was isolated using a synthetic sequence derived Uom the cDNA in clone 8i .s 
shown fn Fig. 5. and the composite of this sequence with that of done 81 .s shown in Fig. 6. Other 
members of the cONA family, including those present m clones I2f. I4i. lib. 7f. 7e 8h 33c 40b 37b 35 
36. 81. 32. 33b. -'^^ . o*^ ^3,, 33^ 39^^ 35,^ ,9^^ jSg and ise are described in Section IV A A 
composite of the cDNAs in these c:ones is described in Section IV,A. 19. and shown in Fig 32 The 
composite cDNA shows that it contains one continuous ORF. and thus encodes a polyproiem "his data .s 
;5 consistent with the suggestion, discussed mfra.. that HCV is a Flavivirus or fiavi-like virus. 

The availability this lamily of cDNAs shown :n Fig-.. 1-32, inclusive, permits the construction of ONA 
probes and polypeptides useful in diagnosing NANBH due to HCV .nfection and .n screening biood donors 
as well as oonated blood and biood products for infection. For example, from the sequences it is possible to 
synthesize DNA oligomers of about 8-1 0 nucleolioes. or larger, which are useful as hybndization probes to 
0 detect the presence of the viral genome in. for example, sera of subjects suspected of harboring the v.rus 
or for screening donated blood for the presence of the virus. The family of cDNA sequences also allows the 
design and production of HCV specific polypeptides which are useful as diagnostic reagents for the 
presence of antibodies raised during NANBH. Antibodies to purified polypeptides derived from the cDNAs 
may also be used to detect viral antigens in infected individuals ana tn blood. 

Knowledge of these cDNA sequences also enable the design and production of polypeptides which 
may be used as vaccines against HCV ana also for the production of antibodies, which .n turn may be used 
for protection against the disease, and/or for therapy of HCV infected individuals. 

Moreover, the family of cDNA sequences enables further characterization of the HCV genome. 
Polynucleotide probes derived from inesc sequences may be used to screen cDNA libraries for additional 
0 overlapping cDNA sequences, which, in turn, may be used to obtain more overlapping sequences. 'Jniess 
the genome is segmented and the segments lack common sequences, this technique may be used to gain 
the sequence of the entire genome. However, if the genome is segmented, other segments of the genome 
can oe obtained by repeating the lambda-gtli serological screening procedure used to isolate the cDNA 
clones described herein, or alternatively by isolating the genome from purified HCV particles. 

The family of cDNA sequences and the polypeptides derived from these sequences, as well as 
antibodies directed against these polypeptides arc aiso useful in the isolation and identification of the BB- 
NANBV agent(s). For example, antibodies directed against HCV epitopes contained in polypeptides derived 
from the cONAs may be used in processes based upon affinity chromatography to isolate the virus. 
Alternatively, the antibodies may be used to identify viral panicles isolated by other techniques. The viral 
antigens and the genomic material within the isolated viral particles may then be further characterized. 

The information obtained from funher sequencing of the HCV genome(s). as well as from further 
characterization of the HCV antigens and characterization of the genome enables the design and synthesis 
of additional probes and polypeptides and antibodies which may be used for diagnosis, for prevention, and 
for therapy of HCV induced NANBH. and for screening for infected blood and blood-related products. 

The availability of probes for HCV. including antigens and antibodies, and polynucleotides denved from 
the genome from which the family of cONAs is derived aiso allows for the development of tissue culture 
systems which, will be of major use in elucidating the biology of HCV. This in turn, may lead to the 
development of new treatment regimens based upon antiviral compounds which preferentially inhibit the 
replication of. or infection by HCV. 

The method used to identify and isolate the etiologic agent for NANBH is novel, and it may be 
applicable 10 the identification and/or isolation of heretofore uncharacterized agents which contain a 
genome, and which are associated with a variety of diseases, including those induced by viruses, viroids. 
bacteria, fungi and parasites. In this method, a cDNA library was created from the nucleic acids pt^eseni in 
infected tissue from an infected individual. The library was created m a vector which allowed the expression 
of polypeptides encoded in the cDNA. Clones- of host ceils containing the vector, which expressed an 
immunologically reactive fragment of a polypeptide of the etiologic agent, were selected by .mmunologicai 
screening of the expression products of the library with an antibody containing body component Irom 
another individual previously infected with the putative agent. The steps in the immunological screening 
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lechn.que .nciuded .meractmg the expression products ol ihe cONA conta.n.ny veciors w.m ihe anr.oooy 
coniain.ng body, componeni of a second .niected .ndr- .duai. and deiechng the formai.on of ani.ooUy.ant.gen 
complexes between the expression producs) and antibodies ol the second m/ected individual. The isolated 
Clones are screened funher innmunologically by interacting thoir expression products with the antibody 

5 containing body components of other individuals infected with the putative agent and with control individuals 
uninfected with the putative agent, and detecting the formation of antigen-aniibody complexes w.ih 
antibodies from the infected individuals: and the cDNA containing vectors which encode polypeptides which 
react immunologically with antibodies from infected individuals and individuals suspected of being infected 
with the agent, but not with control iMividuals are isolated. The infected individuals used for the 

JO construction of the cDNA library, and for the immunological screening need not oe of the same species. 

The cONAs isolated as a result of this method, and their expression products, and antibodies directed 
. against the expression products, are useful in characterizing and/or capturing the etiologic agent. As 
described in mo--- '-'-t. ?his method has been used successfully to isolate a family of cONAs derived 
from the HCV genome. 



H.A. Preparation o' the cDNA Sequence 



Pooled serum from a chimpanzee with chrome HCV infection and containing a high titer of the virus. 
?o I.e.. at least iO* chimp infectious doses/mi (CiD^mi) was used to isolate viral particles; nucleic acds isolated 
from these panicles was used as the template in the construction of a cDNA library to the viral genome. 
The procedures for isolation of putative HCV particles and for constructing the cONA library in -lambda-gtn 
ts discussed in Section IV.A.l. Lambda-gtli is a vector that has been developed specifically to express 
rnserted cONAs as fusion polypeptides with beta-gaiactosidase and to screen large numbers of recombinant 
5 ohage with specific antisera raised against a defined antigen. The lambda-gtl i cDNA library generated from 
a cDNA pool containing cDNA of approximate mean size of 200 base pairs -.vas screened for encoded 
epitopes that could bind specifically with sera derived from patients who had previously experienced NANB 
hepatitis. Huynn. T V, et al. (1985), Approximately 10* phages were screened, and five positive phages 
were Identifier, purified, and then tested ' r specificity of binding to sera from "different humans and 
Chimpanzees previously infected with the HCV agent. One of the phages. S-i-i. bound 5 of the 8 human 
sera tested. This binding appealed selective for sera derived from patients with prior NANB hepatitis 
»n(ections since 7 normal blood donor sera did not exhibit such binding. 

The sequence of the cDNA in recombinant phage 5-1-1 was determined, and is shown m Fig. 1. The 
polypeptide encoded by this cloned cDNA. which is in the same transiaiional frame as the N-terminai beta- 
Galactcsidase moiety of the fusion polypeptide is shown above the nucleotide sequence. This iranslational 
ORF. therefore, encodes an epitope(s) specifically recognized by sera from patients with NANB hepatitis 
infections. 

The availability of the cDNA in recombinant phage 5-1-1 has allowed for the isolation of other clones 
containing additional segments and/or alternative segments of cDNA to the viral genome. The lambda-gtl 1 
cDNA library described supra, was screened using a synthetic polynucleotide derived from the sequence of 
the cloned 5-1-1 cDNA. This screening yielded three other clones, which were identified as 8i. 1-2 and 9i; 
the cDNAs contained within these clones were sequenced. See Sections IV.A.3. and IV.A.4, The homologies 
between the four independent clones are shown in Fig. 2. where the homologies are indicated by the 
vertical lines. Sequences of nucleotides present uniquely in clones 5-1-1. 8i. and 9i are indicated by small 
letters. 

The cloned cDNAs present in recombinant phages in clones 5-i-i. 81. i-2. and 91 are highly 
homologous, and differ in only two regions. First, nucleotide number 67 in clone 1-2 is a thymidine, whereas 
the other three clones contain a cytidine residue in this position. This substitution, however, does not alter 
the nature of the encoded amino acid. 

The second difference between the clones is that clone 5-1-1 contains 28 base pairs at its 5'-terminus 
which are not present in the other clones. The extra sequence may be a s'-terminal cloning artifact: 5*- 
terminal cloning artifacts are commonly observed in the products of cDNA methods. 

Synthetic sequences derived from the s'-region and the 3'-region of the HCV cDNA in clone 8i were 
used to screen and isolate cDNAs from the lamtKla-gtl 1 NANBV cONA library, which overlapped clone 81 
cONA {Section IV.A,5.). The sequences of the resulting cDNAs. which are in done 36 and clone 32. 
respectively, are shown in Fig. 5 and Fig. 7. 

Similarly, a synthetic polynucleotide based on the 5'-region of done 36 was used to screen and isolate 
cDNAs from the lambda gt-li NANBV cDNA library which overiaoped done 35 cDNA (Section IV,A.8.). A 
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puni.ed Clone 01 recombinant pnage-conia.n.ng cDNA «n,ch hyOr.a.^ed to the synihet.c poiynudeoi.dH 
prooe was named clone 35 and the NANBV c^NA sequence contained w„h.n this done is shown .n F,n n 

By ui.l.z,ng me technique of rsolatmg overlapping cDNA sequences, clones coniammg additional 
upstream and downstream HCV cONA sequences have been obtained. The isolation o( these clones , 
described intra in Section IV.A. • ^ 

Analysis of the nucleotide sequences of the HCV cONAs encoded within the isolated clones show ,ha, 
L\T^^"^ continuous ORF. Fig. 26 shows the sequence of the composite 

CDNA from these clones, along with the putative HCV polypeptide encoded therein 

The description of the method to retrieve the cDNA sequences is mostly of historical interest The 
resultan^ sequences (and their complements) are provided herein, .-vnd the sequences, or any portion 
thereol. could be prepared using synthetic methods, or by a combination of synthetic methods with retrieval 
of partial sequences usmq methods similar to those described herein. 

Lambda-gtii strains replicated i.u n the HCV cONA library and from clones 5-M 8i 1-2 and 9i have 
been deposited under the terms of the Budapest Treaty with the American Type Culture Collection (ATCC) 
12301 Parkiawn Dr.. Rockville. Marylano 20852. and have been assigned the following Accession Numbers ' 



iambda-gti i 


ATCC 
No. 


Deposit Date 


HCV cDNA library 


40394 


1 Dec. 1987 


done 81 


40388 


1 7 Nov. 1 987 


clone 91 


40389 


17 Nov. 1987 


done 1-2 


40390 


17 Nov. 1987 


done 5-1-1 


40391 


18 Nov. 1987 



The designated deposits win be maintained (or a penod o( thirty (30) years from the date ol deposit or 
'or five (5) years after the last request lor the deposit; or lor the enforceable life of the U S patent 
whichever IS longer. These deposits and othc. cfAposited materials mentioned herein are intended for 
convenience only, and are not required to practice the present invention in view of the description here. The 
HCV cDNA sequences in all of the deposited materials are incorporated herein by reference 

The description above, of "walking" the genome by isolating overlapping cONA sequences from the 
HCV lambda gt-n library provides one method by which cDNAs corresponding to the entire HCV genome 
may be isolated. However, given the irjformation provided herem. other methods for isolating these cDNAs 
are Obvious to one of skill in the art. Some of these methods are described in Section IV.A.. infra. 



II B- Preparation of Viral Polypeptides and Fi-agments 

The availabifity of cDNA sequences, either those isolated by utilizing the cONA sequences m Figs 1-32 
as discussed infra, as well as the cDNA sequences in these figures, permits the construction of expression 
vectors encoding antigenicsfly active regions of the polypeptide encoded in either strand. These antigeni- 
cally active regions may be derived from coat or envelope antigens or from core antigens, including. »or 
example, polynucleotide binding proteins, polynucleotide polymerase(s). and other viral proteins required for 
the replication and/or assembly of the virus particle. Fragments encoding the desired polypeptides are 
derived from the cDNA clones using conventional restriction digestion or by synthetic methods, and are 
ligated into vectors which may. for example, contain portions of fusion sequences such as beta-GoJ^c- 
tosidase or superijxide dismutase (SOD), preferably SOD. fviethods and vectors which are useful for the 
production of polypeptides which contain fusion sequences, of SOD are described in European Patent Office 
Publication number 0196056. published October 1. 1986. Vectors encoding fusion polypeptides of SOD and 
HCV polypeptides, i.e.. NANBs-.-,. NANB,,. and ClOO-3. which is encoded in a composite of HCV 
cDNAs. are described in Sections IV.B.I. IV.B.2. and IV.B.4. respectively. Any desired portion of the HCV 
cDNA containing an open reading frame, in either sense strand, can be obtained as a recombinant 
polypeptide, such as a maturs or fusion protein: alternatively, a polypeptide encoded in the cDNA can be 
provided by chemical synthesis. 

The DNA encoding the desired polypeptide, whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host systems are presently used in forming recombinant 
polypeptides, and a summary of some of the mere common control systems and host cell lines is given in 
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Sc^cuon iil.A. .ntfa Tne poiypapuoo .s men .soiaied from lysed cetis or f.om me culture rnedium ana 
purified 10 ihe exient needed lor us intended use. PvnUcauon may be by techniques known ,n the an. lor 
example, salt Iractionatton. chromatograoh- on .on exchange resins, allimiy chromatography, cenirifugation 
and the tike. See. lOf example. Methods m En>ymology lor a variety of methods lor punly.ng proteins. Such 
5 polypeptides can be used as diagnostics, or those which give rise to neutrali/.ng annbodiec may be 
lormulated mto vaccines. Antibodies raised against these polypeptides can atso be used as diagnostics or 
lOf passive immunotherapy. In addition, as discussed in Section herein below, antibodies to these 
polypeptides are uselu) for isolating and identifying HCV particles. 

The HCV antigens may also be isolated from HCV virions. The virions may be grown in HCV infected 

0 cells in tissue' Culture, or in an infected host.. 

" C. Preparation r\^*.r,an,r Polyp eptides and Conjugation with Carrier 

5 An antigenic region of a polypepttoe is generally relatively smail-typicaily 8 to lO ammo adds or less 
in lengtr.. Fragments of as few as 5 ammo acids may characterize an antigenic region. These segments 
may correspond to regions of HCV antigen. A'^cordii ^^ly. using the cDNAs of HCV as a basis. DNAS 
encoding short segments of HCV polypeptides can be expressed recombinantly either as fusion proteins, or 
as isolated polypeptides. In addition, short amino acid sequences can be conveniently obtained by chemical 

) synthesis. In instances wherein the synthesized polypeptide is correctly configured so as to provide the 
correct epitope, but is too small to be immunogenic, the polypeptide may be linked to a suitable car-ier. 

A number of techniques for obtaining such linkage are known in the art. including the formation of 
disulfide linkages using N-succinimidyl-3-(2-pyridylthio)propionate (SPDP) and succimmidyl 4.(N-mal- 
eimidomethyi)cyclohexane-l-carboxylate (SI^CC) obtained from Pierce Company. Rockfo'^. Illinois, (if the 

1 peptide lacks a sulfhydryl group, this can be provided by addition of a cysteine residue.) These reagents 
create a disulfide linkage between themselves and peptide cysteine residues on one protein and an amide 
linkage through the epsilon-amino un a iysme. or other free amino group m the other. A variety of such 
disulfide- amide-forming agents are known. See. for example. Immun. Rev. (1982) 62:185. Other biluncttonai 
coupling agents form a ihioeiher rather than ; disulfide linkage, fvlany of these thio'^tBer. forming agents are 
commercially available and inclucd reactive esters of 6-maleimidocaproic acid. 2-bfomoaceiiC' acid. 2- 
lOdoacetic acid. 4-(N-maleimidomethyl)cyclohexane-i-carboxylic acid, and the like. The carboxyl groups can 
be activated Dy combining them with succinimide or i-hydroxyl-2-nitro-4-suifonic acid, sodium salt. The 
foregoing list is not meant to be exhaustive, and modifications of the named compounds can clearly be 
used. 

Any carrier may be used which does not itself induce the production of antibodies harmful to the host. 
Suitable carriers are typically large, slowly metabolized macromolecules such as proteins: polysaccharides, 
such as latex functionalized sepharose. agarose, cellulose, cellulose beads and the like: polymeric amino 
acids, such as poiyglutamic acid, polylysine. and the like: amino acid copolymers: and inactive virus 
particles." see. for example, section II.D. Especially useful protein substrates are serum albumins, keyhole 
limpet hemocyanin. immunoglobulin molecules, thyroglobulin, ovalbumin, tetanus toxoid, and other proteins 
well Jtnown to those skilled in the art. 



'I D- Preparation of Hybnd Particle Immunogens Containing HCV Epitopes 



The immunogenicity of the epitopes of HCV may also be enhanced by prepanng them in mammalian or 
yeast systems fused with or assembled with partide-forming proteins such as. for example, that associated 
with hepatitis B surface antigen. Constructs wherein the NANBV eoitope is linked directly to the particle- 
forming protein coding sequer^ces produce hybrids which are immunogenic with respect to the HCV 
epitope. In addition, all of the vectors prepared include epitopes specific to HBV, having various degrees of 
immunogenicity. such as. for example, the pre-S peptide. Thus, particles constructed from particle forming 
protein which include HCV sequences are i.mmunogenic with respect to HCV and HBV. 

Hepatitis surface antigen (HBSAg) has been shown to be formed and assembled into particles in S. 
cerevisiae (Vaienzueia ei ai. (1982)). as well as in. for example, mammalian cells (Valenzueia. P.. et aT 
(1984)). The formation of such particles has been shown to enhance the immunogenicity of the monomer 
subunii. The constructs may also include the immunodominant epitope of HBSAg, comprising the 55 amino 
acids of the presurface (pre-S) region. Neurath et al. (1984). Constructs of the pre- S-HBSAg particle 
expressible in yeast are disclosed in EPO 174.444. published March 19. 1986: hybrids including heterolo- 
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go..s vrai seouences lo- yeast exp-ession are .^.sciose.i ,n EPO 175.26 J pubi.snea Maicn 26 i96G Bom 
aDDi.C3i.ons are aSs.gnea 10 me he,e,n -ssignec. and aro .nco-Doraiecl here.n Oy reiercce The.e 
consuucis may also oe expressed m mammai.an cells such as Cr^.nese namsier ovary iCHO) cells uc.no ,n 
bvco-d.nydroloiaie reouctase vector (Michelle et ai (i984)) ' ^ 

in addiLon. portion: of the particie-formmg orotem coding sequence may be replaced w„h codon^ 
encoding an HCV epitope In tMi .^placement, regions which are not required to rr^ed.aie the aggregation 
the units to form immunogenic particles m yeast or mammals can pe deleted, thus eliminating additional 
HBV antigenic sues (rom competition with the HCV epitope. 



'0 



.E. Preparation ol Vaccines 



Vaccines may be prepared Iroir one or more immunogenic polypeptides derived from HCV cONA as 
well as Irom me cDNA sequences m u.e Figs. 1-32, or from the HCV genome to wmch they correspond 
^5 The oDserved homology between HCV and Fiav.viruses provides miormation concerning the polypeptides 
which are likely to be most effective as vaccines as w, ; as the regions of the genome m winch they are 
encoded. The general structure of the Flaviv.rus genome is discussed m Rice et al (1986) The fiaviv.rus 
genomic RNA ,s believed to be the only v.rus-specific mRNA spec.es. and u is translated mto the three viral 
structural proteins, i.e.. C. fvl. and E. as well a: two large nonstructural proteins. NV4 and NV5 ana a 
.'0 comolex set of smaller nonstructural proteins. It is known that major neutralizing epitopes lor Flaviv.ruses 
reside in the E (envelope) protein (Roehrig (1986)). The corresponding HCV E gene and polypeptide 
encoding reg.on can be predicted, based upon mt homology to Fiav.viruses. Thus, vaccines may be 
comprised of recombinant oolypeptides containing epitopes ol HCV E. These polypeptides may be 
expressed m bactena. yeast, or mammalian cells, or alternatively may be isolated from virai preparations 11 
IS also anticipated that the other structural prote.ns may also contain epitopes wn.ch give nse to protective 
anti-HCV antibodies. Thus, polypeptides containing the epitopes of E. C. and M may also be used, wnether 
Singly or in combination, in HCV vaccines. ? 

In addition to the above, it has been show< that immunization wuh NSi (nonstructural prote.n t)" results 
-n protection against yellow lever (Sciesinger et al (1986)). This -s true even though the immunizat-on does 
30 not give nse to neutralizing antibodies. Thus, particularly since this prote.n appears to oe highly conserveo 
among Flaviviruses. it is likely that HCV NSi will also be protective against HCV infection, ivioreover n also 
Shows that nonstructural prote.ns may provide protection against viral pathogenicity, even if thoy do not 
cause the production of neutralizing antibodies. 

In view of the above, multivalent vaccines against HCV may be comprised 01 one or more structural 
3S proteins, and'or one or more nonstructural proteins. These vaccines may be comprised of. for example 
recombinant HCV polypeptides and/or polypeptides isolated from the virions. :n addition, it may be possible 
to use inactivated HCV in vaccines: inactivation may be by the preparation of viral lysates. or by other 
means known in the art to cause inactivation of Flaviviruses. for example, treatment with organic solvents or 
detergents, or treatment with formalin. (Moreover, vaccmes may also be prepared Irom attenuated HCV 
40 strains. The preparation of anenuated HCV strains is described infra. 

It is known that some of the proteins in Raviviruses contain highly conserved regions, thus, some 
immunological cross-reactivity is expected between HCV and other Flaviviruses. It is possible that shared 
epitopes between the Flaviviruses and HCV wHi give rise to protective antibodies against one or more of the 
disorders caused by these pathogenic agents. Thus, it may be possible to design multipurpose vaccines 
4S based upon this knowledge. 

The preparation of vaccines which contain an immunogenic polypeptide(s) as active ingredients is 
known to one skilled in the an. Typically, such vaccines are prepared as injectabies. either as liquid 
solutions or suspensions: solid foms suitable for solution in. or suspension in. liquid prior to injection may 
also be prepared. The preparation may also be emulsified, or the protein encapsulated in liposomes. The 
active immunogenic ingredients are often mixed with excipients which are pharmaceutically acceptable and 
compatible with the active ingredient. Suitab'e excipients are. for example, water, saline, dextrose, glycerol, 
ethanol. or me like and combinations thereof. In addition, if desired, the vaccine may contain minor amounts 
of auxiliary substances such as wetting or emulsifying agents. pH buffering agents, and/or adjuvants which 
enhance the eHectiveness of the vaccine. Examples of adjuvants which may be effective include but are not 
limited to: aluminum , hydroxide. N-acetyl-muramyl-L-threonyi-o-isoglutamme m-MDP). N-acetyl-nor- 
moramyi-L-alanyl-D-fSoglutamme (CGP 11 637. referred to as nor-i^DP). N-acetylmuramyi-L-alanyl-O- 
isogiutaminy|.L-alanine-2-(i -2 <l'Paimitoyl-sn-giycero-3-hydroxyphosohoryioxv)-ethyiamine (CGP 19835A 
referred 10 as fvlTP-PE). and Rl3l. which contains three components extracted from oacteria. mon^ 
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ophospnoryi t.u.o A. u^haiose o.inycuiaie ano cell wan skeieron (MPl * 70.. * CWS. -r. 
squalone Tween 80 emuis.on The ellect.veness ol an adjuvant may be aeierm.neo Dy measuring mo 
amouni ol ani.bod.es O.recied aga.nst an „ -munogen.c polypepi.oe conia.n.ng an hCv am.Qen.c sgqu&ocl- 
resuliing from adm.n.strai.on ol ih.s poiypepi.de -n vacanes wn.cn are also compr.sed ol me va,.ouc 
adjuvants. " 

The vaccines are conventionally aam.n.sierud parenteraliy. py injection, lor example e.iher suo- 
cutaneously or .niramusculariy. Additional formulations which are suitapie lor other modes of aomin,sirat.on 
include suppositories and. m some cases, oral formulations. For suppositories, traditional binders and 
earners may induoe, for example, polyalkyiene glycols or inglycei.des; such suppositories may be formed 
from mixtures containing the active ingredient in the range ol 05% to 10%. preferably t%.2% Oral 
formulaTOns include such normally employed exc.pients as. for example, pharmaceutical grades of 
. mann.tol. lactose, starch, magnesium stearate. sodium saccharine, cellulose, magnesium carbonate and the 
like. These compns.tir^ns i^k^ the form ol solutions, suspensions, tablets, pills, capsules, sustained roiease 
formulations or powders and coma..; i0%-95% of active ingredient, preferably 25%-70%. 

The proteins may be formulated i..:o the vaccine as neutral or salt lorms. Pharmaceui.caliy acceptable 
salts include the acid addition salts (formed with free amino groups of the pepi.de) and which are formea 
wuh .norgan.c acds such as. for example, hydrochloric a phosphoric aods. or sucn organ.c acids such as 
acetic, oxalic, tartaric, maie.c. and the like. Salts formed w,th the free carboxyi groups may also be derived 
from inorganic bases such as. for example, sodium, potassium, ammonium, calcium, or feme hyorox.des 
and such organ.c bases as .sopropylamme. tnrrothyiamme. 2-othyiamino ethanol. hist.d.ne. procaine ana 
the like. 



II P- Dosage and Admintstfat.on of Vaccines 

25 

The vaccines are administered .n a manner compatible w.th me dosage formulation, and .n such 
amount as w.if be prophyiacticaily and'or therapeutically effective. The quantity to be aam.msrered. which .s 
generally m the range of 5 micrograms to 250 micrograms of antigen per dose, depends on the subject to 
be treated, capacity of the subject's immune -.ystem to synthesize antibodies, and the degree of protection 
30 desired. Precise amounts of active ingredient required to be administered may depend on the judgment of 
the practitioner and may be peculiar to each subject. 

The vaccine may be given in a single dose schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course of vaccination may be with i-iQ separate doses, 
followed by other doses given at subsequent time intervals required to maintain and or reenfo'ce the 
35 immune response, for example, at 1-4 months for a second dose, and if needed, a subsequent dose(s) after 
several months. The dosage regimen will also, at least m pan. be determined by the need of the individual 
and be dependent upon the judgment of the practitioner. 

In addition, the vaccme containing the immunogenic HCV antigen(s) may be administered in conjunction 
with other immunoreguiatory agents, for example, immune globulins. 

ITG. Preparation of Antibodies Against HCV Epitopes 

The immunogenic polypeptides prepared as described above are used to produce antibodies, octh 
45 polyclonal and monoclonal. If polyclonal antibodies are desired, a selected mammal (e.g.. mouse, rabbit, 
goat, horse, eic.) is immunized with an immunogenic polypeptide bearing an HCV epitcpe(s)- Serum from 
the immunized animal is collected and treated according to known procedures. If serum containing 
polyclonal antibodies to an HCV epitope contains antibodies to other antigens, the polyclonal antibodies can 
be purified by immunoaffinity chromatography. Techniques for producing and processing polyclonal 
50 antisera are known in the art. see for example. Mayer and Walker (1987). 

Alternatively, polyclonal antibodies may be isolated from a mammal which has been previously infected 
with HCV. An example of a method for purifying antibodies to HCV epitopes from serum from an infected 
individual, based upon affinity chromatography and utilizing a fusion polypeptide of SCO and a polypeptide 
encoded within cONA clone 5-1-1. is presented in Section V.E. 
55 Monoclonal antibodies directed against HCV epitopes can also be readily produced by one skilled in 
the an. The general methodology for making monoclonal antibodies by hybridomas is well known. Immonai 
antibody-producing cen lines can be created by eel! fusion, and also by other techniques such as direct 
transformation of 3 lymphocytes with bnccgemc DNA. or transfection with Epstem-Barr virus. See. e.o . M. 
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Scnidiei ei dt (1980). HcimmeilnHj ei 31 (»y8u. Kunneti ei al. dOUOi see. aiso. U S Paioni Noi -J 3-»i Tbl 
4.399.121. 4.427.783: 4.444 887; 4.466.917: 4.- ro.soO: 4.491.632. af~4l93.890 Panels ormonocionoi 
aniiDodies produceo agamsi HCV epitopes can be scfeened for vanous propenies: 1 0 . (or isoiypH. epuone 
atfifiiiy. eic 

: Anitbodtes. bom monnconai and polyclonal, which are directed against HCV epitopes are parncuioriy 
useful in diagnosis, and those which are neutraiumg are useful m passive immunotheraoy Monociond 
anttbc'*ies. m particular. nr»ay be used 10 raise anii-idiotype antibodies. 

Anil* itoiype antibodies are immunoglobulins which carry an "mtoinal image" of the antigen of the 
infectious ^'^erw against which protection is oesired. See. for example. Nisonoff. A., el al. (I98i) ano 
w Dreesman et 1I. (1985). 

Techniques for raising anii-idiotype antibodies are known m the an. See. for example. Grzych (1985). 
MacNamara et a (i984). and Uytdehaag et al. (1985). These anti-idioiype antibodies may also be useful for 
treatment of NAN8H. as well as (or an elucidation of the immunogenic regions of HCV antigens 

II. H. Diagnostic Oitgonudeottde Probes and Kits 



Using the disclosed portions of the isolated HCV cDNAs as a basis, including those m Figs. 1-32. 
oligomers of approximately 8 nucleotides or more can be prepared, either by excision or synthetically. 

:o which hyondize with the HCV genome and are useful m identification of the viral ageni(s). (unher 
Characterization of the viral genome(s). as well as in detection of the virus(es) in diseased individuals. The 
probes for HCV polynucleotides (natural or derived) are a length which allows the detection of unique viral 
sequences by hybridization. While 6-8 nucleotides may be a workable length, sequences of 10-12 
nucleotides are preferred, and about 20 nucleotides appears optimal. Preferably, these sequences wilt 

rs derive from regions which lack heterogeneity. These probes can be prepared using routine methods, 
including automated oligonucleotide synthetic methods. Among useful probes, for example, are the clone 5- 
1-1 ano the additional clones disclosed herein, as well as the various oligomers useful in probing cDNA 
libraries, set forth below. A complement to any unique ponion of the HCV genome will be satisfactory..^ For 
use as probes, complete complementarity i;, desirable, though it may be unnecessary as the tenc^lh of the 

30 fragment is increased. 

For use of such probes as diagnostics, the biological sample to be analyzed, such as blood or serum, is 
treated, if desired, to extract the nucleic acids contained therein. The resulting nucleic acid from the sample 
may be subjected to gel electrophoresis or other size separation techniques: alternatively, the nucleic acid 
sample may be dot bloned without size separation, the probes are then labeled. Suitable labels, and 

35 methods for labeling probes are known in the art. and include, for example, radioactive labels incorporated 
by nick translation or kinasing. biotin. fluorescent probes, and chemiluminescent probes. The nucleic acids 
extracted from the sample are then treated with the labeled probe under hybridization conditions of suitable 
stringencies. 

The probes can be made completely complementary to the HCV genome. Therefore, usually high 

^0 stnngency conditions are desirable in order to prevent false positives. However, conditions of high 
stringency should only be used if the probes are complementary to regions of the viral genome which lack 
heterogeneity. The stringency ot hybridization is determined by a number of factors during hybridization 
and during the washir^i procedure, including temperature, ionic strength, length of time, and concentration 
of formamide. These factors are outlined in. for example. Maniatis, T. (1982). 

-»5 Generally, it is expected that the HCV genome sequences will be present in serum of infected 
individuals at relatively low levels, i.e.. at approximately lO^-iO^ sequences per ml. This level may require 
that amplification techniques be used in hybridization assays. Such techniques are known in the art. For 
example, the Enzo Biochemical Corporation "Bio-Bridge" system uses terminal deoxynudeotide transferase 
to add unmodified 3 -poly-dT-tails to a DNA probe. The poly dT-tailed probe is hybridized to the target 

50 nucleotide sequence, and then to a biotin-modified poly-A. PCT application 84/03520 and EPA124221 
describe a DNA hybridization assay in which: (i) analyte is annealed to a single-stranded DNA probe that is 
complementary to an enzyme-labeled oligonucleotide; and (2) the resulting tailed duplex is hybridized to an 
enzyme-labeled oligonucleotide. EPA 204510 describes a DNA hybridization assay in which analyte DNA is 
contacted with a prot>e that has a tail, such as a poly-dT tail, an amplifier strand that has a sequence that 

55 hybridizes 10 the tail of the probe, such as a poly-A sequence, and which is capable of binding a plurality of 
labeleo strands. A particularly aesirable technique may first involve amplificatjon of the target HCv 
sequences m sera approximately 10.000 fold, i.e., to approximately lO^ sequences/ml. This may oe 
accomplished, for example, by the technique of Saiki et al. (1986). The amplified secuenceis) may then be 
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NO. 2300-017, wn.ch was Med i5 Oc.ooer ,987. .s ass.cjnea ,o ,he hcro.n ass.gnee. and !s ^1:1 
-ncorporaied he.o.n by reference This hybr.. ;3„on assay, wh.ch sfiould deieci sequer^ces a. ihe level ol 
10- ml unlives nucie.c aca muil.mers wh.ch b.nd lo s.ngle-si.anaed anaiyie nude.c acd and wh.ch also 
bind 10 a muli.pl.c.ty ol singie-s.randod labeled oi.gonucieot.oes. A suitable solution phase sandwich assa 
which may bo used with labeled polynucleotide probes, and Ihe methods lor the preparation ol probes il 
descr.bej ,n EPO 225.807. published Ju.^e ,6. 1987. wh.ch .s ass.gned ,o the herein ass.gM„«. and wmch ,1 
hereby mcorporaied herein by rolerence. 

The probes, can be packaged into diagnostic k.is. Diagnostic kits include the probe DNA. which may be 
labeled: alternatively, the probe ONA may be unlabeled and the ingredients lo- labeling may be included m 
the kit. The kit may also contain other suitably packaged reagents and materials needed (or the particular 
hybridization protocol, lor example, standards, as well as mstruciions lor conducting the lest 



■5 lU. Immunoassay and Diagnostic Kns 

Both me polypeptides which react immunologically wr , serum containing mCV antibod.es (or example 
those derived (rom or encoded within the clones described m Section IV.A.. and composites thereof (see 
section tV.A.) and the antibodies raised against the HCV specific epitopes m these polypeptides see lor 
20 example Section IV.E. are useful m immunoLisays to detect presence o( HCV ant.bod.es. or the preser-e 
o( the virus and/or viral antigens, in biological samples, including (or example, blood or serum samples 
Design o» the immunoassays is subject to a great deal of variation, and a variety of these are known in the 
art. For example, ihe immunoassay may utilize one viral antigen, for example, a polypeptide derived from 
any of the clones containing HCV cDNA described in Section IV.A.. or from the composite cDNAs derived 
?5 from the cONAs in these clones, or from the HCV genome from which the cDNA in these ctones is derived- 
alternatively, the immunoassay may use a combination of viral antigens derived from these sources. It may 
use. (or example, a monoclonal antibody directed towards a viral epitope(s). a combination of monoclonal 
antibodies directed towards one viral antigen, monoclonal antibodies directed towards different virai 
antigens, polyclonal antibodies directed toward- the same viral antigen, or polyclonal antibodies directed 
30 towards diHerent viral antigens. Protocols may be based, (or example, upon competition, or direct reaction 
oc sandwich type assays. Protocols may also, for example, use solid supports, or may be by .m- 
munoprecpitation. Most assays involve the use of labeled antibody or polypeptide: the labels may be for 
example, fluorescent, chemiluminescent. radioactive, or dye molecules. Assays which amplify the signals 
from the probe are afso known; examples of which are assays which utilize biotm and avidin. and enzyme- 
35 labeled and mediated immunoassays, such as ELISA assays. 

The Flavivirus model for HCV allows predictions regarding the likely location of diagnostic epitopes (or 
the vinon structural proteins. The C. pre-iwi. M. and E domains are all likely to contain epitopes of significant 
potential for detecting viral antigens, and particularly foV diagnosis. Similarly, domairs of the nonstructural 
proteins are expected to contain important diagnostic epitopes (e.g., NS5 encoding a putative polymerase: 
40 and NSl encoding a putative complement-binding antigen). Recombinant polypeptides, or viral polypep- 
tides, which include epitopes (rom these specific domains may be useful (or the detection of viral antibodies 
m infections blood donors and infected patients. 

In addition, antibodies direaed against the E and/or M proteins can be used in immunoassays (or the 
detection o( viral antigens in patients with HCV caused NANBH. and in in(ectious blood donors, (wloreover 
<5 these antibodies will be extremely useful in detecting acute-phase donors and patients. 

Kits suitable (or immonodiagnosis and containing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the polypeptides o( the invention containing HCV epitopes or 
antibodies directed against HCV epitopes in suitable containers, along with the remaining reagents and 
rr.atsr:a!s required for the conduct ?! the assay. =s -.vs:! as a suitable set ot assay instructions. 



lU. Purther Characterization of the HCV Genome. Virions, and Viral Antigens Using Probes Derived From 
cONA to the Viral Genome 

The HCV cDNA sequence information in the clones described in Section IV.A.. as shown .n Figs. 1-32. 
inclusive, may be used to gain further information on the sequence of the HCV genome, and for 
Identification and isolation of the HCV agent, and thus will aid in its characterization including the nature of 
the genome, the structure of the viral pamcle. and the nature of the antigens of w,icn it is composed. This 
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.nlorrnahon. .n turn, can lead lo adC.nonal poiynuCleoi.de oroOes. poiypepi.Oes der.ved (rom ihe HCV 
genome, and ani.boO.es directed aga.nsi HCV ep.iope5> wmcn v^ould be use»ul ror the d.agnos.s and or 
ueaimeni o( HCV caused NANBH. 

The cDNA sequence .ntormai.on ,n the above-moni.oned clones .s use/ul tor the des.gn ot probes for 
5 the isolation ot adoit.onal cDNA sequences wn.ch are der.ved from as yet unoefined regions o( the HCv 
genome(s) frorr^ which the cDNAs in clones described m Section IV.A. are derived. For example. labeled 
probes containing a sequence ol approximately 8 or more nucleotides, and preferably 20 or more 
nucleotides, which are derived from reo.'.ns close to the s'-termin. or a'-iermm. of the family of HCV cDNA 
sequences shown in Figs, l, 3. 6. 9. 14 and -.2 may be used to isolate overlapping cONA sequences from 
ro HCV cONA libraries. These sequences which overlap the cDNAs m the above-mentioned clones, but which 
also contain sequences derived from regions of the genome from which the cONA in the above mentioned 
clones are not derived, may then be used to synthesize probes for identification of other overlapping 
fragments wh.ch do not necessanly overlap the cDNAs in the clones cescnbed in Section IV.A. Unless the 
HCV genome .s segmented and the segments lack common sequences, it .s possible to sequence Ihe 
5 entire v.ral genome(s) utilizing the technique of isolation of overlapping cDNAs derived from the viral 
genome(s). Although it is unlikely, if the genome is a segmented genome wh.ch lacks common sequences, 
the sequence of thb genome can be determineo by serologically screening lambda-gtil HCV cDNA 
libraries, as used to isolate clone 5-M. sequencing cDNA isolates, and using the isolated cDNAs to isolate 
overlapping fragments, using the technique described for the isolation and sequencing of the clones 
0 described m Section IV.A. Alternatively, characterization of the genomic segments could be from ihe viral 
genome(s) isolated from purified HCV panicles. Methods for purifying HCV particles and for detecting them 
during the purification procedure are described herein, infra. Procedures for isolating polynucleotide 
genomes from vira! particles are known in the art. and one procedure which may be used .s shown ir 
Example IV.A.l. The isolated genomic segments could then be cloned and sequenced. Thus, with the 
5 information provided herein, it is oossible to done and seo'.ence the HCV genome(s) irrespective of their 
nature. 

Methods for constructing cDNA libraries are known in the an. and are discussed supra andHnfra- a 
method for the construction of HCV cDNA libranes in lambda-gili is discussed ififra in Section IV.A 
However. cDNA libraries which are useful *or sc-:iening with nucle.c acid probes may also be constructed in 
0 other vectors known in the an. for example. lambda-gtiO (Huynh et al. (1985)). The HCV derived cONA 
detected by the probes derived from the cONAs in Figs. i-32. and from the probes synthesized from 
polynucleotides derived from these cONAs. may be isolated from the clone by digestion of the isolated 
polynucleotide with the appropriate restriction enzyme(s). and sequenced. See. for example. Section IV.A.3. 
and IV.A.4. for the techniques used for the isolation and sequencing of HCV cDNA which overlaps HCV 
cDNA in clone 5-1-1. Sections IV.A.5-IV.A.7 for the isolation and sequencing of HCV cONA which overlaps 
that in clone 81, and Section IV.A.8 and IV.A.9 for the isolation and sequencing of a clone which overlaps 
another clone (clone 36). which overlaps clone 81. 

The sequence information derived from these overlapping HCV cDNAs is useful for determining areas 
of homology and heterogeneity within the viral genome(s). which could indicate the presence of different 
strains of the genome, and/br of populations of defective particles. It is also useful for the design of 
hybridization probes to detect HCV or HCV antigens or HCV nucleic acids in biological samples, and during 
the isolation of HCV (discussed infra), utilizing the techniques described in Section M.G. Moreover, the 
overlapping cONAs may be used to create expression vectors for polypeptides derived from the HCV 
genome(s) which aJso encode the polypeptides encoded in clones 5-1-1. 36. 8 1. 91. and 1-2. and in the 
other clones described in Section IV.A. The techniques for the creation of these polypeptides containing 
HCV epitopes, and for antibodies directed against HCV epitopes contained within them, as well as their 
uses, are analogous to those described for polypeptides derived from NANBV cONA sequences contained 
within clones 5-1-1. 32. 35. 36. 1-2. 81. and 9i. discussed supra and infra. 

Encoded within the family of cDNA sequences contained within clones 5-1-1. 32. 35. 36. 81. 91. i-2. 
and the other clones described in Section IV.A. are antigen(s) containing epitopes which appear to be 
unique to HCV: i.e.. antibodies directed against these antigens are absent from individuals infected with 
HAV or HBV. and from individuals not infected with HCV (see the serological data presented in Section 
IV.B ). Moreover, a comparison of the sequence information of these cDNAs with the sequences of HAV. 
H9V. HOV. and with the genomic sequences in Genebank indicates that minimal homology exists between 
these cONAs and the polynucleotide sequences of those sources. Thus, antibodies directed against the 
antigens encoded within the cDNAs of these clones may be used to identify BB-NANBV particles isolated 
from infected individuals. Ir^ add;;ion. they are also useful for the isolation of NANBH agent(s). 

HCV panicles may be isolated from the sera from BB-NAN8V infected indiviouals or from cell cultures 
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Dy any o( tne memoas known .n .no a.,, .nclud.ng (or e.amp.e. lecho.ques oasen c s..e d.scnm.na..on 
sucn as sed.men,at.on or exclus.on me.hoas. or .ecnn.ques bacod on flens..y such as u..racen,r.fugai.on .n 
density gradients, or precipitation w.ih ageo-s such as poiyethyit.ne glycol, or chromatography on a vanetv 
o( rr^ateriais such as anionic or cation.c exchange materials, and materials which i.nd due to hydrophobic 

s ity. as well as aHm.iy columns. During the isolation procedure the presence o( HCV may be detected bv 
i.yfcndizaiion analysis ol the extracted genome, using probes derived Irom the HCV cONAs descnbed 
supra, or by immunoassay (see Section ll.l.) utilizing as probes antibodies directed against HCV am.oens 
encoded wthin the family ol cDNA sequences shown in Figs. 1-32. and also directed against HCV antlnc 
encoded within the overlapping HCV cDNA sequences discussed supra. The antibodies may be mor-- 

>o oclonal. or polycional. and it may be desiraL.e to purify the aniibod.es before their use in the immunoassay 
A punlicaiion procedure for polyclonal antibodies directed against aniigen(s) encoded within clone S-i-i is 
described m Section IV.E: analogous purification procedures may be utilized for antibodies directed against 
other HCV antigens. 

Antibodies directed against HCV antigens encoded within the tar,.iiy-of cDNAs shown in Figs 1-32 as 
15 well as those encoded within overlapping HCV cONAs. which are affixed to solid suppons are useful for "the 
isolation of HCV by immunoaffinity chromatography. Techniques for .mmunoaffinity chromatography are 
known in the art. including techniques ,or affixing antibodies to solid supports so that they retain their 
immunoselective ac. :;y: the techniques may be tnose in which the antibod.es are adsorbed to the support 
(see. tor example. Kurstak in ENZYME IMMUNOOIAGNOSIS. page 31-37). as well as those in which the 
?o ant.bod.os are covalently linked to the support. Generally, the techn.ques are similar to those used lor 
covalent Imk.ng of antigens to a solid support, which are generally described in Section It C however 
spacer groups may be included in the bilunctional coupling agents so that the antigen binding site of the 
antibody remains accessible. 

During the purification procedure the presence of HCV may be detected and-or verified by nucleic acid 
hybridization, utilizing as p.obes polynucleotides derived from the family of HCV cDNA sequences shown in 
Figs. 1-32. us well as from overlapping HCV cONA sequences, described supra. In this case the fractions 
are treated under conditions which would cause the disruption of v.rai particles, for example, with de'ergents 
in the presence of chelating agents, and the presence ol viral nucleic acd determined by hybridization 
techniques described m Section ll.H. Further confirmation that the isolated particles are the agent* which 
30 induce HCV may be obtained by infecmg -.h.mpanzees with the isolated virus particles, fcllowed by a 
determination of whether the symptoms of NANBH result from the infection. 

Viral particles from the purifieo preparations may then be further characterized. The genomic nucleic 
acid has been punfied. Based upon its sensitivity to RNase. and not DNase I. it appears that the virus is 
composed of an RNA genome. See Example IV.C.2.. infra. The strandedness and circularity or non- 
« circulanty can determined by techniques known m the art. including, for example, its visualization by 
electron microscopy, its migration in density gradients, and its sedimentation characteristics. Based upon 
the hybndization of the captured HCV genome to the negative strands of HCV cONAs. it appears that HCV 
may be compnsed of a positive stranded RNA genome (see Section IV.H.I). Techniques such as these are 
described in. lor example. METHODS IN ENZYMOLOGY. In addition, the purified nucleic acid can be 
Cloned and sequenced by known techniques, including reverse transcription since the genomic material .s 
RNA. See. for example. Maniatis (1982). and Glover (1985). Utilizing the nucleic acid derived from the viral 
particles, it is possible to sequence the entire genome, whether or not it is segmented. 

Examination of the homology of the polypeptide encoded within the continuous ORF of combined 
clones t4i through 39c (see Fig. 26). shows that the HCV polypeptide contains regions of homology with 
-5 the corresponding proteins in conserved regions of llaviviruses. An example of this is described in Section 
IV.H.3. This finding has many important ramifications. First, this evidence, in conjunction with the results 
which show that HCV contains a positive-stranded genome, the size of which is approximately lOOOO 
nucleotides, is consistent with the suggestion that HCV is a flavivirus. or flavi-like virus. Generally, flavivirus 
virions and their genomes have a relatively consistent structure and organization, which are known See 
0 Rice et al. (1986). and Brinton. M.A. (1988). Thus, the structural genes encoding the polypeptides C. pre- 
M/M. and E may be located in the 5 -terminus of the genome upstream of clone I4i. Moreover, using the 
companson with other flaviviruses. predictions as to the precise location of the sequences encoding these 
proteins can be made. 

Isolation of the sequences upstream of those in clone I4i may be accomplished in a number of ways 
which, given the information herein, would be obvious to one of skill in the art. For example, the genome 
•walking- technique, may be used to isolate other sequences which are s' to those in clone I4i. but which 
overlap that done: this in turn leads to the isolation of additional sequences. This technique has been amply 
demonstrated infra, in Section IV.A.. For example, also, it .s known that the flaviviruses have conserved 
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ep.iop.s and .eg.ons ot conserve^ n,.ri^.c acd seouonces Po.ynucooi.dos con.nmmg >mo co..s.i,v«d 
sequences may be useo.as proDes wh.ch b.na .he HCV genome, tr-.s aiiow.„g „s .soianon in add.i.on 
these conserved sequences, .n con,unct.on wih mose derived l-om the HCV cDNAs shown ,n F,n 22 ,T,av 
be used to des.gn pr.mers (or use .n systeois wh.ch an^piily ihe genome sequences upstream ot ,nose ,n 
Clone 4.. using polymerase cha.n reaction technology. An example ol Ih.s ,s described mlra 

The structure ol the HCV may also be determined and its components isolated. The morphology and 
s,;e may be determined by. tor example, electron .microscopy. The ident.f.cai.on and locah.auon of speci/ic 
viral polypepl.de ant-gens such as coal or envelope antigens, o. .niernal antigens, such as nucleic acid 
binding prote.ns. corft antigens, and polynucleoi.f-> poiymernse(s) may also be determined by. for example 
determ.n.ng y^helher the antigens are present as major or minor viral components, as well as by utilizing 
annbodies directed agamsi the specific antigens. encoded within isolated cDNAs as probes. This inlormation 
IS useful in the des.gn ol vaccines: for example, it may be preferable to include an exterior antigen in a 
vaccine preparation. Multivalent vaccines may be comprised of. for example, a polypeptide derived Irom the 
genome encoding a structural protem. lor example. E. as well as a polypeptide Irom another portion oi the 
'5 genome, lor example, a nonstructural or structural polypeptide. 
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MX Cell Culture Systems and Animal Model Systems lor HCV Replication 

;o The suggestion that HCV is a llaviv.rus or llav..|,ke virus also provides mlormanon on methods lor 
growing HCV. The term "flav.-like" means that the virus shows a significant amount of homology to the 
known conserved regions of llaviviruses and that the maior.iy of the genome .s a single ORF Methods for 
cullunng flaviv.ruses are known to those ol skill in the art (See. lor example, the reviews by Snnton (1986) 
and Stollar. v. (1980)). Generally, suitable celts or cell lines lor culturing HCV may include those known to 

.-5 suppon Flavmrus replication, for example, the following; monkey kidney cell lines (eg MK2 VERO)- 
porcine kidney cell lines (e.g. PS): baby hamster kioney cell l.nes (e.g. BHK): mur.ne macrophage cell lines 
(e.g.. P388D1. MKl. Mmi): human macrophage cell l.nes (e.g.. U-937): human peripheral blood leukocytes 
human adherent monocytes: hepatocytes or hepatocyte cell, lines (e.g.. HUH7 . HEPGSV embryos or 
embryonic cells (e.g.. chick embryo fibroblasts): c cell lines derived Irom invertebrates, preferably from 

JO insects (e.g. drosophiia cell lines), or more preferably from arthropods, lor example, rr.osquito cell lines 
(e.g.. A. Albopictus. Aeoes aegypti. Cutc^x tritaeniorhynchus) or tick cell lines (e.g. RML-t4 Oermacentor 
parumapertus). 

It is possible that pnmary hepatocytes can be cultured, and then infected with HCV: or alternatively the 
hepatocyte cultures could be derived from the livers ol infected individuals (e.g.. humans or chimpanzees) 
The latter case is an example ol a cell which is infected in vivo being passaged in vitro. In addition various 
immortalization methods can be used to obtain cell-lines derived from hepafoc7te~aritures For example 
pnmary liver cultures (before and after enrichmbnt of the hepatocyte population) may be fused to a variety 
ol cells to maintain stability. For example, also, cultures may be infected with transforming viruses or 
transfected with transforming genes in order to create permanent or semipermanent cell lines. In addition 
40 lor example, cells in liver cultures may be fused to established cell lines (e.g.. HepG2 ). Methods for cell 
fusion are known in the art. and include, lor example, the use of fusion agents such as polyethylene glycol 
Sendai Virus, and Epstein-Barr virus. 

As discussed above. HCV is a Flavivirus or Flavi-like virus. Therefore, it is probable that HCV infection 
of cell lines may be accomplished by techniques known in -.ne art for infecting cells with Flaviviruses These 
*s include, for example, incubating the cells with viral preparations under conditions which allow viral entry into 
the cell. In addition, it may be possible to obtain viral production by transfecting the cells with isolated viral 
polynucleotides. It is known that Togavirus and Flavivirus RNAs are infectious in a variety of vertebrate cell 
lines (Pfetferkom and Shapiro 1974)). and in a mosquito cell line (Peleg (1969)). Methods lor transfecting 
tissue culture cells with RNA duplexes, positive stranded RNAs. and DNAs (including cONAs) are known in 
50 the art. and include, for example, techniques which use electroporation. and precipitation with DEAE- 
Oextran or calcium phosphate. An abundant source ol HCV RNA can be obtained by pertorming in vitro 
transcnption of an HCV cDNA corresponding to the complete genome. Transfection with this mateTiaFS? 
with cloned HCV cDNA should result in viral replication and the in vitro propagation of the virus. 

In addition to cultured cells, animal model systems may be used tor viral replication: animal systems in 
which flaviviruses are known to thos? of skill in the art (See. lor example, the review by Monath (i986)) 
Thus. HCV replication may occur not only in chimpanzees, but also .n. for example, marmosets and 
suckling mica. 
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'I L Scirtomng loi AiUi-Viial Agents lo. HCV 

The ava.iaD.i.ty ol cell culture ana a... -.51 mooei systems lo, HCV also makes possible screemnn .„ 
anii-vrai agents wh.ch .nh.p.t HCV -epiication. ana particularly tor mose agents -h.ch prelerent.aiiv f.io 
cell growth dnd muinpl.cat.on while inh.p.i.ng v.ral replication These screen.nn me.nods are known bl ,l °Z 
01 skill in the ait. Generally, the anti-v.rai agents are tested at a vanety ol concentrations, lor their ellert^n 
preventing viral replication m cell culture systen^s which support viral replication, and then lor an ,nh,bi ion 
ol inlectivity or o( viral pathogenicity (and a low level of toxicity) in an ammai model system 

The methods and compositions provided herein lor delecting HCV antigens and HCV poiynucleoiid«« 
are uselul lor screening ol anti-viral agents m that they provide an alternative, and perhaps more sensitive 
means.-Tor detecting the agent's ellecl on viral replication than the cell plaque assay or iDso assay For 
example, the HCV-polynucleotide probes described herein may be used to quantitate the amount ol viral 
nucleic acid produced in a cell culture. This could be accomplished, for example, by hybnoi^ation or 
competition hybridization of the inlecied cell nucie.c acds with a labeled HCV-poiynudeotide probe For 
example, also. anti-HCV antibodies may. be used to identify and quantiiate HCV ant.gen(s) m the cell culture 
utilizing the immunoassays described herein, in addition, since it may be desirable to quantitate HCV 
antigens m the infected cell culture by a com-eiitior. assay, the polypeptides encoded withm the HCV 
cDNAs described herem are useful m these competition assays. Generally, a recombinant HCV polypeptide 
derived Irom the HCV cDNA would be labeled, and the inhibition ol binding of th.s labeled polypeptide to an 
HCV polypeptide due to the antigen produced In the cell culture system would be monitored iwioreover 
these techniques are panicularly uselul m cases where the HCV may be able to replicate in a cell line 
without causing cell death. 



a \iM^ Preparation ol Anenuated Strains of HCV 

In addition to the above, utilizing the tissue culture systems and or ammai model systems ,v may be 
possible to isolate attenuated strains of HCV These strains would be suitable lor vaccines oi lor the 
isolation of viral antigens. Attenuated strains are isolatable after multiple passages m cell culture and-or an 

30 animal model. Detection of an. attenuated stiam in an infected cell or individual is achievable by techniques 
known in the aa. and could include, for example, the use of antibodies to one or more epitopes encoded m 
HCV as a probe or the use of a polynucleotide containing an HCV sequence of at least about 8 /lucleotides 
as a probe. Alternatively, or in addition, an attenuated strain may be constructed utilizing the genomic 
information ol HCV provided herein, and utilizing recombinant techniques. Generally, one would attempt to 

IS delete a region of the genome encoding, for example, a polypeptide related to pathogenicity, but which 
allows viral replication. In addition, the genome construction would allow the expression of an epitope which 
gives rise to neutralizing antibodies for HCV. The altered genome could then be utilized to transform cells 
which allow HCV replication, and the cells grown under conditions to allow viral replication. Anenuated HCV 
strains are useful not only for vaccine purposes, but also as sources for the commercial production of viral 

0 antigens, since the processing of these viruses would require less stringent protection measures for the 
employees involved in viral production and-or the production of viral products. 



Ml. General f^ethods 

The general techniques used in extracting the genome Irom a virus, preparing and probing a cDNA 
library, sequendno clones, constructing expression vectors, transforming cells, perlorming immunological 
assays such as radioimmunoassays and ELlSA assays, for growing cells in culture, and the like are known 
in the art and laboratory manuals are available describing these techniques. However, as a general guide, 
the following sets forth some sources currently available for such procedures, and for materials useful m 
carrying them out. 



il' A. Hosts and Expression Control Sequences 

55 

Both prokaryotic. arid eukaryotic host cells may be used for expression of desired coding sequences 
when appropriate control sequences which are compatible with the designated host are used. Among 
prokaryotic hosts. E. coli is most frequently used. Expression control sequences lor prokaryotes include 
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promoters, opi.onaiiy coma.n.ng operator pon.ons. and ..Dosome binding sues. Transfer vectors compatible 
w.in prokaryotic hosts are commonly derived I om. lor erampte, pbR322. a plasm.d coma.n.ng operons 
conferring amp.cM.n and leiracychne resistance, and the vanous pUC vectors, wh.ch also coma.n se- 
quences conlernng ant.b.ot.c resistance maiKers. These markers may be used to obta.n successful 
iransiormants by seiect.on. Commonly used prokaryotic control sequences include the Beta-iactamase 
(pen.c.lhnase) and lactose promoter systems (Chang et al. (1977)). the tryptophan (trp) promoter system 
(Goeddel et al. (1930)) and the lambda-denved P, promoter and N gene ribosome binding site (Sh.matake 
et al. (1981)) and the hybrid tac promoter (De Boer et al. (1983)) derived from sequences of the trp and lac 
UV5 promoters. ThB foregoing systems are pan.cularly compatible w,ih E. coli: if desired, other ^karyoli? 
hosts such3rstra.ns of Bacillus or Pseudomonas may be used, with corrTspoSding control sequences 

Eukaryottc hosts include yeast and mammalian cells in culture systems. Saccharomyces cerev.s.ae and 
Saccharomyces carisbergensis are the most commonly used yeast hosts, and are convenient fungal hosts 
Yeast compatible vectors carry markers which permit selection of successful transformants by conferring 
proiotrophy to auxotrophic mutants or resistance to heavy metals on wiid-type strains. Yeast compatible 
vectors may employ the 2 micron origin of replication (Broach et al. (1983)). the combination of CEN3 and 
ARSi or other means for assuring replication, sucn as seqi- ;nces which will result in incorporation of an 
appropriate fragment into the nosi cell genome. Control sequences for yeast vectors are known m the an 
and include promoters for the synthesis of glycolytic enzymes (Hess et al. (1968): Holland et al. (1978)) 
including the promoter for 3 phosphoglycerate kinase (Hitzeman (1980)). Terminators may also be included 
such as those denved from the enolase gene (Holland (I98i)). Particularly useful control systems are -those 
which comprise the glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter or alcohol c^e- 
hydrogenase (ADH) reguiatable promoter, terminators also derived from GAPDH. and if secretion is desired 
leader sequence from yeast alpha factor. In addition, the transcriptional regulatory region and the transcnp- 
t.onai .nitiaiion region which are operabiy Imked may be such that they are not naturally associated .n the 
w.id-iype organism. Ti.ese systems are described in oeia.l .n EPO 120.551. published' October 3 1984 
EPO 116.201. published August 22. 1984; and EPO 164.556. published December 18. 1985. all of wh.ch are 
assigneo to the herein assignee, and are hereby incorporated herein by reference. 

Mammalian cell lines available as hosts fo- expression are known in the art and include many 
immortalized cell lines available from the Ar.eric^n Type Culture Collection (ATCC). including Heta cells 
Chinese hamster ovary (CHO) cells, baby hamster kidney (BHK) cells, and a number of other cell lines. 
Suitable promoters for mammalian cells are also known m the art and include viral promoters such as that 
from Simian Virus 40 (SV40) (Fiers (1978)). Rous sarcoma virus (RSV). adenovirus (ADV). and bovine 
papilloma virus (BPV). fy/1ammalian cells may also require term.nator sequences and poly A addition 
sequences: enhancer sequences which increase expression may also be included, and sequences which 
cause amplification of the gene may also be desirable. These sequences are known in the art. Vectors 
suitable for replication in mammalian cells may include viral replicons. or sequences which insure 
integration of'the appropriate sequences encoding NANBV epitopes into the host genome. 



^ IM.B. Transformations 

Transformation may be by any known method for introducing polynucleotides into a host cell, including, 
for example packaging the polynucleotide in a virus and transducing a host cell with the virus, and by direct 
uptake of the polynucleotide. The transformation procedure used depends upon the host to be transformer 

^5 For example, transformation of the E. coli host cells with lambda-gtl l containing BB-NANBV sequences is 
discussed in the Example section, infra. Bacterial transformation by direct uptake generally employs 
treatment with calcium or rubidium chloride (Cohen (1972). f^aniatis (1982)). Yeast transformation by direct 
uptake may be earned out using the method of Hinnen et al. (1978). Mammalian transformations by direct 
uptake may be conducted using the calcium phosphate precipitation method of Graham and Van der £b 

50 (1978), or the various known modifications thereof. 



in c. Vector Construction 

Vector construction employs techniques which are known m the art. Site-specific DNA cleavage ts 
performed by treating with suitable restnction enzymes under conditions which generally are specified by 
the manufacturer of these commercially available enzymes. In general, about t microgram of plasmid or 
DNA sequence is deaved by i unit of enzyme m about 20 microliters buffer solution by incubation of i-2 hr 
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and tne DNA recovered Dy prec.p..a,.on w„h e.ha..oi Tne -.eaved .ragmen.s r.ay be separated u ' no 
poiyaayiam.de o. agarose goi electrophoresis techn.qucs. according to w.e ge-erai procedures !n S 
Methods .n Enzymology (l£80) 65:499-560. P'OCOd.ires (ound ,n 

Sfcky ended cleavage fragments may be blunt ended us.ng E. col. ONA polymerase i (Kleno«, ,n the 
presence of the appropriate deoxynucleot.de tr.phosphaies (dNTPT) ^sen, .n the mixture. Treatmem J..h 
SI nuclease may .iso'be used, resulting in the hydrolysis o( any single stranded ONA port.ons 

bgai-ons are earned out us.ng standard butler and temperature conditions us.ng T4 DNA ligase and 
ATP: sticky end l.gations require less AT^^ and less ligase than blunt end ligations. When vector fLments 
are used as pan of a ligation mixture, the vector fragment is often treated with bacterial alkaline 
phospRatase (BAP) or calf .ntestinal alkaline phosphatase to remove the s'-phospha.e and thus preveni 
religation of the vector: alternatively, restriction enzyme digestion of unwanted fragments can be used to 
prevent ligation. " 

Ligation mixtures are iranslormed .nto su.tabie clon.ng hosts, such as E. col., and successful iranslor- 
mants selected by. for example, antibiotic resistance, and screened lor the cTrri^ construction. 



I" D Construetion of Desired DNA Sequences 

!0 Synthetic oi.gonucieot.des may be prepared using an automated oi.gonucleo'ide synthesizer as de- 
scribed by Warner (1984). If desired the synthetic Strands may be labeled with «p by treatment w.th 
polynucleotide kinase in the presence of J->-ATP. using standard conditions for the reaction - 

ONA sequences, including those isolated from cONA libranes. may be modified by known techniques 
including, lor example site directed mutagenesis, as described by Zoller (1982). Briefly \fe ONA to be 
:$ mooif.ed is packaged into phage as a s.ngie stranded sequence, and converted to a doubfe siranded ONA 
vv.ih DNA polymerase us.ng. as a primer, a synthetic oligonucleotide complementary to the portion of the 
DNA to be mod.f,ed. and having the desired modification included in .ts own sequence. The resulting 
double stranded DNA is transformed into a phage supporting host bacterium. Cultures of the transformed 
bactena. wh.ch contain replications of ea. i strand of the phage, are plated in agar .o obtain plaques 
Theoretically. 50% of the new plaques contain phage having the mutated sequence, and the remaining 50% 
have the original sequence. fReplicaies of the plaques are hybridized to labeled synthetic probe at 
temperatures and conditions which permit hybridization with the correct strand, but not with the unmodified 
sequence. The sequences which have been identified by hybridization are recovered and cloned 
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III.E. Hybridization with Probe 

DNA libraries may be probed using the procedure of Grunstem and Hogness (1975). Briefly m this 
procedure, the DNA to be probed is immobilized on nitrocellulose filters, denatured, and prehybridized with 
a buffer containing 0-50% formamide. 0.75 M NaCl. 75 mivi Na citrate. 0.02% (wt,v) each of bovine serum 
albumin, pofyvmyl pyrollidone. and Ficoll. 50 mM Na Phosphate (pH 6.5). 0.1% SOS and lOO 
micrograms/ml earner denatured DNA. The percentage of formamide in the buffer, as well as the time and 
temperature conditions of the prehybridization and subsequent hybridization steps depends on the strin- 
gency required. Oligomeric probes which require lower stringency conditions are generaHy used with low 
percentages of formamide. fewer temperatures, and longer hybridization times. Probes containing mere than 
30 or 40 nucleotides such as those derived from cONA or genomic sequences generally employ higher 
temperatures. e.g.. about 40-42 C. and a high percentage, e.g.. 50%. formamide. Following prehybridiza- 
tion. 5 -P-labeied oligonucleotide probe is added to the buffer, and the filters are incubated in this mixture 
under hybridization conditions. After washing, the treated filters are subjected to autoradiography to shew 
the location of the hybridized probe: ONA in corresponding locations on the original agar plates is used as 
'he source of the desired ONA. 
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"l-F Verification of Construction and Sequencing 

For routine vector constructions, ligation .nnixtures are transformed into E. coli strain HBlOl or other 
suitable host, and successful transformants selected by antibiotic resistancroTTther markers Plasn.ids 
from the transformants are then prepared according to the method of Clewell et al. (i969). usually lollowing 
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chiorampnen.col ampi.i.cat.on (Cewe.i (1972), The ONA .5 .so.u.uu ana anaiy.uo. us..n, „y .os.nco.. 
enzyme anaiys.s ana or sequenc.ng. Soquenc.ng nay be hy ir.o .l.deo.y n^omoo of San.jc e. .1 (i977i as 
lurmer Jescnoed by Massing et al (I98i). o, by the moihoa ol Ma.am e. al (i980. P.ob.erm ,,„h oano 
compression, wn.ch are sor^et.mes observed ,n GC r,ch ,ey.o„s. were ove,corn„ oy use o( T 
deazoguanosine according to Sarr ei al. (i986). 

lll.G. Enzymes Linked Immunosorbeni Assay 

The enzyme-linhed .mmunosorbent assay (ELISA) can be used to measure e.mer am.gen or ani.Dody 
concentrations. This method depends upon conjugation of an en.^yme to e.mer an antigen or an aniibodv 
and uses the bound enzyme activity as a quantitative label. To measure antibody, the known antigen is 
fixed to a solid phase (e.g.. a microplaie or plastic cup), mcubaied w.m test serum dHui.ons washed 
■ncuba.ed w.th anti-.mmuncgiobulin labeled with an enzyme, and wasned again. Enzymes suitable lo.' 
labeling are k.nown in the art. and mclod- lor example, horseradish peroxidase. Enzyme activity bound to 
the solid phase is measured by adding the speolic subsi .le. and determmmg product lormat.on or 
substrate utilization colonmetncally. The enzyme activity bouna is a direct lunction of the amount of 
antibody bound. 

To measure antigen, a known specific antibody is lixed to the soi.d phase, the test material conta.n.nq 
antigen <s added, after an mcubation the solid phase -s washed, and a second enzyme-laoeied antibody .s 
added. After washing, .substrate -s added, and enzyme activity .s estimated colonmetricaliy and related to 
antigen concentration. 



IV. Examples 

Described below are examples of the present invention wh.cn are provided only for illustrative purposes 
and not to limit the scope of the present invention. In light of the present disclosure, numerous 
embodiments w.th.n the scope of the cla.mi wilt be apparent to those of ordinary skill in the an The 
procedures set forth, for example, in Sections IV A. may. .f des.red. be repeated but need not be as 
techniques are available for construction of the des.red nucleotide sequences based on the information 
provided by the invention. Expression is exemplified in E. coli: however, other systems are available as set 
.onh more fully .n Section lll.A. Additional epitopes derived from the genomic structure may also be 
produced, and used to generate antibodies as set forth below. 

'V A. Preparation. Isolation and Sequencing of HCV cONA 

IV.A.1. Preparation of HCV cDNA 

The source of NANB agent was a plasma pool derived from a chimpanzee with chrome NANBH The 
Chimpanzee had been experimentally infected with blood from another chimpanzee with chronic NANBH 
resulting from infection with HCV in a contaminated batch of factor 8 concentrate derived from pooled 
human sera. The chimpanzee plasma pool was made by combining many individual plasma samples 
containing high levels of alanine aminotransferase activity: this activity results from hepatic injury due to the 
HCV infection. Since 1 ml of a lO"' dilution of this pooled serum given i.v. caused I^ANBH in another 
Chimpanzee, its CID was at least lO'/ml. i.e.. it had a high infectious virus titer. 

A cDNA library from the high titer plasma pool was generated as follows. Rrst. viral panicles were 
isolated from the plasma: a 90 ml aliquot was diluted with 3iO ml of a solution containing 50 mlvl Tris-HCl 
pH 8,0. imf^ EDTA. 100 mM NaCl. Debris was removed by centritugation for 20 min at 15.000 x g at 20* C 
Viral particles in the resulting supernatant wera then pelleted by centrifugation in a Beckman SW28 rotor at 
28.000 rpm for 5 hours at 20 C. To release the viral genome, the panicles were disrupted by suspending 
the pellets m 15 ml solution containing 1% sodium dodecyl sulfate (SDS). iO mM EDTA. 10 mf^ Tris-HCl 
pH 7.5. also containing 2 mg/ml. proteinase k. followed by incubation at 45'C for 90 min Nucleic acids 
were isolated by adding 0.8 micrograms MS2 bacteriophage RNA as carrier, and extracting the mixture four 
times with a 1:1 mixture of phenol:chloroform (phenol saturated w.ih 0.5M Tns-HCl. pH 7.5. 0.1% (v/v) beta- 
mercaptoethanol. 0.1% (w/v) hydroxyaumolone. followed by extraction two times w.th chloroform The 
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.itiiMKius pM.isi; ^^Ji CKjiKvnuaUiO wu*^ l-Outjn,)i pmji lo piuCi|>.[ai.uti «.in v...i'jfr al..->.. ih , 

ovu.n.gni ai -20 C Nucte-c aca was <ocov«,ea oy cotimiu.jaiiu. .n a BccKman SW4i roioi a\ iixkhi 

loi \)0 min at 4 C ann dissolved m waior irii r,aci Ocon ire.iiea ^.im 0 05»- iv v» a.eihyipyn.. „.i ..,'„! 

aiitociavdO 

Nucleic aoa oDtameo tiy the above proceau»e {<2 microgram-) was denaiured w.ii. i / 
CH.HgOH. cONA was synmesuod usmg ihis cenaiured nucleic ace as lompiate. and was ciu.....i .ni,, „,„ 
EcoRI site of phage lambda-gili us.ng methods described by Huynn (1985). except thai ran.i.n.i |i,„r.n,o 
replaced oligo(cJT» I2-18 durmg me synthesis of the first cDNA sirano by reverse transcnptasb i l..yi... „i 
(19761). The resulting double stranded cDNAs were fractionated according to sue on a SephrtM.«o rA-4H 
column;. ^led material ol approximate mtan si:e 400. 300, 200. and lOO base-pairs wero |MM,in,i „,i.> 
cDNA pools 1. 2. 3. and 4. respeciively. The lamoda-gtli cDNA library was generated from ihi. i ONA m 
pool 3. 

The lambda-gin cDNA library generated from pool 3 was screened for epitopes Ihdi ,..ui.i i^imj 
specifically with serum derived from a patient who had previously experienced NANBH. Aboui ii»^ phn.jn 

wore screened w.ih patient sera usmg the methods of Huynh et al (i985). except that b.i I humon 

antibody was detected with sheep anti-human ig antisera that had Deen radio-iaoeied with '-M r.yp povuvn 
phages were identified and purified. The .ive positive php-^es were then tested for specificity dl i..n.i„„j i,> 
sera from 8 different :,umans previously infected with the NANBH agent, usmg the same meih.»i I'oui r»i 
the phage encoded a polypeptide that reacted immunologically with only one human serum, lu tim unn 
that was used tor primary screening of the phage i.brary. The fifth phage (5-1-1 ) encoded a poiypopnup thai 
reacted immunologically wuh 5 of 8 of the sera tested. Moreover, this polypeptide did noi .oacI **n. 
munologically with sera from 7 normal blood donors. Therefore, n appears mat done 5-1-1 -m onos <i 
polypeptide which is specifically recognized immunologically by sera from NANS patients. 



'^^^ Sequences of me HCV cDNA in Recombinant Phage S-i-i. ana of the Polypeptide EnciM*n'i Wiih.n 
me Sequence. 



The cDNA in recombinant phage 5-i-t was sequenced by me method of Sanger ei c*i fifSV) 
Essoniially. me cDNA was excised with Fcon . isolated by sire fractionation usmg gel electropn..ipc.ti i r.e, 
EcoRl restriction fragments were subcioned mio the M13 vectors. mpi8 and mpi9 (Messing (luiM)) amj 
sequenced usmg the dideoxycham termination method of Sanger et ai. (1977). The sequence Mhiampo 
shown in Fig. 1 . 

The polypeptide encoded. in Fig. 1 that is encoded m the HCV cDNA is m the same transiati»'»t3i framr, 
as the NMerminal bela-galactosidase moiety to wmch it is fused. As shown m Section IV.A.. the iirtMciationai 
open reading frame (ORF) of 5-1-1 encodes epitope(s) specifically recognized by sera from ppimnis nno 
chimpanzees with NANBH infections. 



'V-A.3. Isolation of Overlapping HCV cDNA to cDNA m Clone 5-M. 



Overlapping HCV cDNA to the cONA m clone 5-1-1 was obtained by screening the same laiMlj«ta-gii i 
library, created as described in Section IV. A.I. . with a synthetic polynucleotide derived from the s«.|i»o^co or 
the HCV cONA in clones 5-i-i, as shown in Fig. i. The sequence of the polynucleotide used foi ♦coomng 
was: 

5'-TCC CTT OCT CGA TQT ACQ GTA AGT OCT GAG AGO ACT CTT CCA TCT CAT CGA ACT i . rC QGT 
AGA GGA CTT CCC TGT CAG GT-s'. 

The lambda-gin library was screened with this probe, usmg the method described in Huynii ( 10135) 
Approximately i in 50.000 clones hybridized with the probe. Three clones which contained cONAc wmicm 
hybridized with the synthetic probe have been numbered 81. 1-2. and 9i. 



'V A.4. Nucleotide Sequences of Overlapping HCV cDNAs to cONA m Clone 5- 1 - 1 . 

The nucleotide sequences of the three cDNAs in clones 8i. 1-2. and 91 were determined essoMhpilv at 
m Section IV.A.2. The sequences of these clones relative to the HCV cDNA sequence in pnago ''t-t-i -a 
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snown .0 F,g. 2. wn.ch snows ihe suand encoding mo ooiecioo HCV eo-iope. and wr.-„e me homoloo.es ,n 
me nucieoiioe sequences are .ndicaiod by erticai imes between me sequence*^ 

The sequences o( me cloned HCV cDNAs are h.ghiy nomologous ,n me overtapp.ng regions (see F.n 
2). However, mere are differences .n two regions. Nucieoi.de 67 ,n clone 1-2 .s a mymio.no whereas me 
omer mree clones conta.n a cyt.d.ne res-due in m,s pos.i.on. Ii should be noted, iiowever. mat me same 
amino acid is encoded wnen either C or T occupies this position. 

The second difference is that clone 5-M contains 28 base pa.rs which are not present m the other 
three clones. These base pairs occur at th« start of the cDNA sequence m s-m, and are indicated by small 
letters. Jased on radioimmunoassay data, which .s discussed infra in Section IV.D.. it is possible that an 
HCV epitope may be encoded in mis 28 bp region. 

The absence of the 28 base pairs of 5-1. i from clones 81, 1-2. and 9 1 may mean that the cONA in 
these Clones were denved from defective HCV genomes: alternat.velv. the 28 bp region could be a terminal 
artifact in clone 5-1-1. 

The sequences of small letters m the nucleotide sequence of clones 8i ana 9i simply indicate that 
these sequences have not been found m other cONAs because cDNAs overlapping these regions were not 
yet tsoiated. 

A composite HCV cDNA sequence derived from overlapping cDNAs m clones 5-1-1. 8i. 1-2 and 9i is 
snown in Fig. 3. However. ,n m.s figure the unique 28 base pairs of clone 5-1-1 are omitted. The figure also 
shows the sequence of me polypeptide encoded within me ORF of the composite HCV cONA. 



'V A.S. Isolation of Overlapping HCV cDNAs to cDNA m Clone 8i . 

The isolation of HCV cDNA sequences upstream of. and wh.cn overlap those .n clone 8i cDNA was 
accomplished as follows. The lambda-giii cDNA library prepared as described ..i Section iv.A.i. was 
screened by hybridization with a synthetic polynucleotide probe wh.ch was homologous to a s'termmal 
sequence of clone 6*.. Tne sequence of clone 8i is presented .n F,g. 4. The sequence of the synthetic 
polynucleotide used for screening was: 

5' CTG TCA GOT ATG ATT GCC GGC TTC CCG GAC 3'. 

The methods were essentially as described in Huynh (i985). except that the library filters were given two 
washes under stringent conditions, i.e.. the washes were m 5 x SSC. 0.1% SDS at 55'C for 30 minutes 
each. Approximately 1 .n 50.000 clones hybridized with the probe A positive recombinant phage which 
contained cDNA which hybridized with the sequence was isolated and purified. This phage has been 
numbered clone 36. 

Downstream cONA sequences, which overlaps the carboxyl-end sequences m clone '8i cONA were 
isolated using a procedure similar to that for me isolation of upstream cONA sequences, except mat a 
synthetic oligonucleotide probe was prepared which is homologous to a 3' terminal sequence of done 8i. 
The sequence of the synthetic polynucleotide used for screening was: 

5' TTT GGC TAG TGG TTA GTG GGC TGG TGA CAG 3' 

A positive recombinant phage, which contained cDNA which hybridized with this latter sequence was 
isolated and purified, and has been numbered clone 32. 



'V A.6. Nucleotide Sequence of HCV cDNA in Clone 36. 

The nucleotide sequence of the cONA in clone 36 was determined essentially as described in Section 
IV.A.2. The double-stranded sequence of this cDNA. its region of overlap with the HCV cDNA in clone 81, 
and the polypeptide encoded by the ORF are shown in Fig. 5. 

The ORF in clone 36 is in the same iranslational frame as the HCV antigen encoded in clone 81. Thus, 
in combination, me ORFs in cfones 36 and 81 encode a polypeptide that represents pan of a large HCV 
antigen. The sequence of mis putative HCV polypeptide and the double stranded DNA sequence encoding 
It. which IS derived from the combined ORFs of the HCV cDNAs of clones 36 and 81. is shown in Fig. 6. 
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>V A.r Nucioot.cle ScQuenCtfS o» hCV cDNA in Clone 32 

Tho nucieohOe seouence ol the cDNA .n clone 32 was detcrmmed essftntiaiiy as was ihai described .n 
Section IV.A.2 lor me sequence o» clone 5-i-i. The sequence data indicated that the cDNA .n clone 32 
recot.ibinam phage was derived from two ri,ffereni sources. One fragment of the cDNA was comprised ol 
418 nucieoiides derived Irom the HCV genome: the other fragment was comprised of 172 nucteotioes 
derived from the baciencphage MS2 genome, which had been used as a earner durmg the preparation of 
the lambda gti i plasma cDNA library. 

The sequence of the cDNA m clone 32 co. espondmg to that of the HCV genorr^e is shown in rig. > 
The region of the sequences thai overlaps that of done 8t. and the polypeptide encoded by the OftF are 
also- indicated m the figure. This sequence contains one continuous ORF that is .n the same transianonal 
frame as the HCV antigen encoded by clone 8v 



'V A.8 isolation cf Overlapping HCV cDNA to cDNA in Clone 36 

The isolation of HCV ^DNA sequences upstream of. and which overlap those in clone 36 cDNA was 
accomplished as described in Section IV.A.S. for those which overlap clone 8i cDNA. except that the 
synthetic polynucleotide was based on the S'-region of clone 36. The sequence of the svnihetic poly- 
nucleotide used for screening was; 

5* AAG CCA CCG TGT GCG CTA GGG CTC AAG CCC 3* 

Approximately i m 50.000 clones hybndized with the probe. The isolated, purified done of recombinant 
phage wmcn contained cONA which hybridized to this sequence was named clone 35. 



'V.A.9 Nucleotide Sequence cf HCV cDNA m Clone 35 

The nucleotide sequence of me cDNA m clone 35 was determined essentially as described in Section 
IV.A.2. The sequence, its region of oveilap with that of the cDNA m clone 36. and the putative polypeptide 
encoded therein, are shown in Fig. 8. 

Clone 35 apparently contains a single, continuous ORF that encodes a polypeptide in the same 
transiational frame as that encoded by done 38. clone 8i. and clone 32. Fig. 9 shows the sequence of the 
long continuous ORF that extends through clones 35. 36. 81. and 32. along with the putative HCV 
polypeptide encoded therein. This combined sequence has been confirmed using other independent cDNA 
clones derived from the same lambda gtl 1 cDNA library. 



'V.A.iO. Isolation of Overlapping HCV cDNA to cONA in Clone 35 



The isolation of HCV cDNA sequences upstream of. and which overlap those in clone 35 cDNA was 
accomplished as described in Section IV.A.8. for those which overlap done 36 cDNA. except that the 
synthetic polynucleotide was based on the s'-region of done 35. The sequence of the synthetic poly- 
nucleotide used for screening was: 

5' CAG GAT GCT GTC TCC C6C ACT CAA CGT 3' 

Approximately 1 in 50.000 clones hybridized with the probe. The isolated, purified done of recombinant 
phage which contained cDNA which hybridized to this sequence was named clone 37b. 



'V.A.11. Nucleotide Sequence of HCV in Clone 37b 

The nucleotide sequence of the cDNA in clone 37b was determined essentially as described in Section 
IV.A.2. The sequence, its region of overlap with that of the cDNA m clone 35. and the putative polypeptide 
encoded therein, are shown in Fig. iO. 

The 5 -terminal nucleotide of clone 35 is a T. whereas the corresponding nucleotide m clone 37b is an 
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A. The cDNAs from tnree oiher .ndepenaoni clones which were isolaiod Ourmg the proceOure wh.ch 
clone 37b was isolated, described m Sectton IV.A.IO. navo also boon sequenced. The cDNAs from these 
Clones also contain an A m ihis pos.non. Thus, the s'-iermmai T m clone 35 may oe an artefact of the 
cloning procedure. U is known thai artefacts often arise at the s'-iormmi of cDNA molecules. 

Clone 37b apparently contains one continuous ORF which encodes a polypeptide which is a conimu- 
anon of the polypeptide encoded m the ORF which extends through the overlapping clones 35. 36. 8i and 



'V A.12 Isolation of Overlapping HCV cDNA to cDNA m Clone 32 



The isolation ol HCV cDNA sequences downstream ol clone 32 was accomplished as follows. First, 
clone cia was isolated utilizing a synthetic hybridization probe which was based on the nucleotide senuence 
of the HCV cDNA sequence in clone 32. The method was essentially that described m Section :*.A.5. 
except that tne sequence of the synthetic probe was: 
5* AGT GCA GTG GAT GAa CCG GCT GAT AGC CTT 3 . 



Utilizing the nucleotide sequence from- clone cia, another synthetic nucleotide was synthesized which 
had the sequence: 

5* TCC TGA GGC GAC TGC ACC ACT GGA TAA GCT 3* 

Screening of the larnbda gtn library using the done da derived sequence as prooe yielded -oproximateiy 
1 in 50.000 positive colonies. An tsoiated. purified done which hybridized with this prooe was named done 
33b. 



IV A.T3 Nucleotide Sequence of HCV cDNA m Done 33b 

The nucleotide sequence of the cONA in done 33b was determined essentially as described m Section 
IV.A.2. The sequence, its region of overlap with that of the cONA in clone 32. and the putative polypeptide 
encoded therein, are shown in Fig. ii. 

Clone 33b apparently contains one continuous ORF which is an extension of the ORFs in overlapping 
clones 37b. 35. 36. 8i and 32. The polypeptide encoded in clone 33b is m the same translaiional frame as 
that encoded in the extended ORF of these overlapping clones. 



'V A.14 Isolation of Overlapping HCV cDNAs to cDNA Clone 37b and to cDNA m Clone 33b 



In order to isolate HCV cDNAs which overlap the cDNAs in clone 37b and in clone 33b. :he following 
synthetic oligonucleotide probes, which were derived from the cDNAs in those clones, were used to screen 
the lambda gtll library, using essentially the method described in Section IV.A.3. The probes used were: 

5' CAG GAT GCT GTC TCC CGC ACT CAA CGT C 3* 
and 

5* TCC TGA GGC GAC TGC ACC AGT GGA TAA GCT 3' 

to detect colonies containing HCV cDNA sequences which overlap those in clones 37b and 33b. 
respectively. Approximately l in 50.000 colonies were detected with each probe. A clone which contained 
cONA which was upstream of. and which overlapped the cDNA m clone 37b. was named clone 40b. A clone 
which contained cDNA which was downstream of. and which overlapped the cDNA in clone 33b was named 
clone 25c. 



'V A.15 Nucleotide Sequences of HCV cDNA m clone 40b and m clone 25c 

The nucleotide sequences of the cDNAs m cionp 40b anc m clone 25c were determined essentially as 
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UciCDc.! m S»C..On IV A 2 Tha sequences ol 40t) ana -5c. ....... .o.j..„„ „„, ..DNAc 

Clones 37t) and 33d. and ihe putai.ve poiypopi.oes o.icodG.i (i.om..,, a.*- s.v.«n ,„ F=u i? (rion- 40b, ana 
P'lj 13 (Clone 25c). 

The 5 -.ernr,.nal nudeoMde o( clone 40b -s a G. Ho„evo.. .h„ cDNAs i.o-n i.v„ .noepenaem clone- 
s wh.cr- were -soiaied Ounng the procedure .n wh.ch done 40b was .soia.ea. dcsched .n Sect.on IVA m" 
have also been sequenced. The cONAs Irom these clones also conia.n a T .n in.s pos.i.on Thus, ihe G ma« 
represent a cloning artilac (see tne discussion m Section IV A.i 1). 

The 5*-ierminus ol clone 25c is ACT. bui the sequence ol ih.s legion ,n done da (sequence not 
Shown,, and in clohB 33b is TCA. Th.s d.Mer jnc. may also reprosent a clomng ari.lact. as may Ihe 28 extra 
'0 5 •terminal OdCleotides in clone S-to. 

Clones 40b and 25c each apparently contain an ORF which is an extension ol the continuous ORF ,n 
Ihe previously sequenced clones. The nucleotide sequence ol the ORP extending through clones 40b 37b 
35. 36. 81. 32. 33b. and 23C. and the ammo acid sequence ol the putative polypeptide encoded therein are 
Shown m Fig. 14. In the figure, the ootent.al an.lacts have been om.iied from the sequence, and .nsiead' the 
'S corresponding sequences m non-5 -terminal regions ol multiple overlapping clones are shown. 

'^ ^ '6 Preparation ol a Composite HCV cDNA Irom the cONAs m Clones 36. St. and 32 

•0 The composite HCV cDNA. CiOO. was cor^siructed as Icllows. First the cDNAs Irom the clones 36 8i 
and 32 were excised with EcoRl. The Ecofll Iragment ol ;DNA from each done was cloned individually' into 
the EcoRl site of the vector pGEI^3-bluc (Promega Biotec). The resulting recombinant vectors which 
contained the cDNAs fro.m clones 36. 8t. and 32 were named pGEM3-Diue 36. pGEI^3-blue/8i and 
pGEM3-blue 32. respectively. The appropriately oriented recombinant of pGEM3-oiue/8i was digested w.th 

5 Naei and Nar!. and the large (-2850DP) Iragmeni was purified and ligated w.th the small (-570bp) Nael Narl 
puMlied restriction Iragment Irom pGEf^3-biue 36. This composite ol the cONAs irom clones 36 and 8i was 
used to generate another pGEM3-blue vector containing the continuous HCV ORF contained within the 
overlapping cDNA within these clones. This new piasmid was then digested w.th Pvull and EcoRl to release 
a fragment of approximately 680bp. which was ; .en ligated with the small (580bp) PvulLEcoRI fragment 

0 isolated from the appropriately oriented pGEIvl3-biue.'32 plasmid. and the composite cONA from clones 36 
81 and 32 was l.gated into the EcoRl linearized vector pSODdl. which is described in Section IV 8 i and 
wh.ch was used to express clone S-i-i in bacteria. Recombinants containing the •i270bp Ecofll fragment of 
composite HCV cDNA (C-so) were selected, and the cDNA from the plasmids was exdsed with Ecofll and 

purified. 
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^^■^•^ 'so'at'Oh and Nucleotide Sequences of HCV cDNAs in Clones I4i. lib. 7f. 7e. 8h. 33c. I4c. 8;. 33f. 



-.0 The HCV cDNAs .n clones Ui. nb. 7f. 7e. 8h. 33c. I4c. 8f. 33f. 33g. and 39c were isolated by the 
technique of isolating overlapping cDNA fragments from the lambda gtll library of HCV cDNAs described 
in Section iV.A.t.. The technique used was essentially as described m Section IV.A.3.. except that the 
probes used were designed from the nucleotide sequence of the last isolated clones from the 5 and the 3' 
end of the combined HCV sequences. The frequency of clones which hybridized with the probes described 
5 below was approximately l In 50.000 in each case. 

The nucleotide sequences of the HCV cDNAs in clones I4i. 7|. 7e. 8h. 33c. I4c. 8f. 331 33g and 39c 
were determined essentially as described in Section IV.A.2.. except that the cDNA excised from these 
phages were substituted for the cDNA isolated from clone 5-i.i. 

Clone 33c was isolated using a hybridization probe based on the sequence ol nucleotides in clone 40b 
I The nucleotide sequence of clone 40b is presented in Fig. 12. The nucleotide sequence of the probe used 
to isolate 33c was: 

5' ATC AGG ACC GGG GTG AGA ACA A IT ACC ACT 3' 

The sequence of the HCV cONA in clone 33c. ano the cveriap with that m done 40b. is shown in Rg. 15. 
which also shows the ammo acids encoded therein. 

Clone 3h was isolated using a probe based on the sequence of nucleotides in done 33c. The 
nucleotide sequence of the probe was 
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5 AGA GAC AAC CAT GAG GTC CCC GGT GTT C 3 



The sequence of ihe HCV cDNA .n clone 3h. and the Overlap win thai .n done 33c. ana the am.no acos 
encoded therein, are shown in Fig. i6. 

Clone 7e was isolated us.ng a probe based on the sequence ol nucleotides .n clone 8h. The nucteot.de 
sequence of the probe was 

5" TCG GAC CTT_TAC CTG GTC ACG AGG CAC 3". 

The sequence of HCV cDNA in clone 7e. the overlap w.th done 8h. and the am.no acios encoded therein 
are shown in Fig. 1 7. 

Clone 14c was .sotated w.th a probe based on the sequence of nucleotides .n clone 25c. The sequence 
ol clone 25c is shown .n F.g. i3. The probe in the isolation of done Mc had the sequence 

5' ACC TTC CCC AT- AAT GCC TAC ACC ACG GGC 3*. 

The sequence of HCV cDNA .n clone Uc. its overlap w.th that ,n clone 25c. and the am.no acds encoded 
therein are shown m Fig. 18. 

Clone Sf was isolated using a probe based on me sequence of nucleotides m clone Mc. The nucieoi.de 
sequence of the probe was 

5* TCC ATC TCT CAA GGC AAC TTG CaC CGC TAA 3*. 

The sequence of HCV cDNA m clone 3f. .ts overlap w.tn that m clone Uc. and the ammo acids encoded 
inerein are shown in Fig. 19 

Clone 33f was isolated using a probe based on the nucleotide sequence present m clone 8( The 
nucleotide sequence of the probe was 

S' TCC ATG GCT GTC CGC TTC CAC CTC CAA AGT 3*. 

The sequence of HCV cDNA in clone 33f. its overlap with that in clone 8f and the ammo acids encoded 
therem are shown in Fig. 20. 

Clone 33g was isolated using a probe based on the sequence of nucleotides m done 33f. The 
nucleotide sequence of the probe was 

5' GCG ACA ATA CGA CAA CAT CCT CTG AGC CCG 3'. 

The sequence of HCV cONA in done 33g. its overlap w.th that .n done 33f. and the am.no acids encoded 
therein are shown in Fig. 2i. 

Clone 7f was isolated usmg a probe based on the sequence of nucleotides m clone 7e. The nucleotide 
sequence of the probe was 

5* AGC AGA CAA GGG GCC TCC TAG GGT GCA TAA T 3*. 

The sequence of HCV cDNA m clone 7f. its overlap with clone 7e. and the ammo acids encoded therein are 
shown in Rg. 22. 

Clone nb was isolated using a probe based on the sequence of done 7f. The nucleotide sequence of 
the probe was 

5' CAC CTA TGT TTA TAA CCA TCT CAC TCC TCT 3'. 

The sequence of HCV cDNA m done lib. its overlap with done 7f. and the ammo acios encoded therein 
are shown in Fig. 23. 

Clone I4i was isolated usmg a probe based on the sequence of nucleotides m clone np. The 
nucleotide sequence of the probe was 

5* CTC TGT CAC CAT ATT ACA AGC GCT ATA TCA 3* 
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The sequence ct HCV cDNA .n clone u,. ,is overlap .v.ih iih ,inc :hc s.-tmho dcJs encooeo iherc.n are 
snown in Fig. 24. 

Clone 39c was fsoiaied usmg a probe oased on the iequonce or nucleotides rp c.ooc 33q The 
rudeoitde sequence of me proDe was 

5' CTC GTT GCT ACQ TCA CCA CAA TTT GGT GTA 3 

The sequence of HCV cONA in done 39c. -s overlap w.th done 33g. and the am.no ac.ds encoded therein 
are shown m Fig. 25. 



'^•^•^Q- Composite HCV cDNA Sequence Derived from Isolated Clones Containing HCV cDNA 

The HCV cDNA sequences m the isolated clones described supra hove been aligned to create a 
composite HCV cDNA sequence. The ir-^laied clones, aligned m the 5* to 3' direction are t4t 7f 7e 8h 
33c. 40t). 37b. 35. 2*^ ^1. 32. 33b. 25c. ^4c. 8f. 33.'. 33g. and 39c. 

A composite HCV cDNA sequence derived from the isolated clones, and the ammo acids encoded 
therein, is shown m Fig. 26. 

In creating the composite sequence the following sequence heterogeneities have been considered. 
Clone 33c contains an HCV cDNA of 800 base pairs, which overlaps the cDNAs in dones 40b and 37c- In 
done 33c. as well as in 5 other overlapping clones, nudeotrde #789 is a G. However, in done 37b (see 
Section iV.A.ii), the corresponding nucleotide is an A. This sequence difference creates an apparent 
heterogeneity tn the amino acids encoded therein, which would be either CYS or TYR. for G or A. 
respectively. This heterogeneity may have important ramifications m terms of protein folding. 

Nucleotide residue 02 m clone 8h HCV cDNA is a T. However, as shown infra, the corresponding 
residue m clone 7e is an A; moreover, an A in this position ts also found in 3 o:hef isolated overlapping 
clones. Thus, the T residue in clone 8h may represent a cionmg amfact. Therefore, in Fig. 26. the residue m 
this position is designated as an A, 

The 3 -terminal nucleotide m done 8f HCV cDPJA is a G. However, the corresponding residue m clone 
33f. and in 2 other overlapping clones is a T. Therefore, m Fig. 26. the residue in this position is designated 
as a T. 

The 3 -terminal sequence in clone 33f HCV cDNA »s TTGC. However, the corresponding sequence m 
done 33g and in 2 other overlapping clones is ATTC. Therefore, in Fig. 26. the corresponding region is 
represented as ATTC. 

Nucleotide residue #4 m clone 33g HCV cDNA is a T However, in done 33f and in 2 other overlapping 
dones the corresponding residue is an A. Therefore, in Fig. 26. the corresponding residue is designated as 
an A. 

The 3 -terminus of done I4i is an AA. whereas the corresponding dinudeoiide in clone nb. and in 
three other clones, is TA. Therefore, in Fig. 26. the TA residue is depicted. 
The resolution of other sequence heterogeneities is discussed supra. 

An examination of the composite HCV cDNA indicates that it contains one large ORF. This suggests 
that the viral genome is translated into a large polypeptide which is processed concomitant with, or 
subsequent to translation. 



IV.A.19, Isolation and Nucleotide Sequences of HCV cDNAs m Clones i2f. 35f. i9g, 26g. and i5e 

The HCV cDNAs in clones i2f. 35f. i9g. 26g. and iSe were isolated essentially by the technique 
described in Section IV.A.17. except that the probes were as indicated below. The frequency of dones 
which hybridized with the probes was approximately 1 in 50.000 in each case. The nucleotide sequences of 
the HCV cONAs in these clones were determined essentially as described in Section IV.A.2.. except that the 
cONA from the indicated dones were substituted for the cDNA isolated from clone 5-1-1. 

The isolation of done i2f, which contains cDNA upstream of the HCV cDNA in Fig. 26. was 
accomplished using a hybridization probe based on the sequence of nucleotides in clone I4i. The 
nucleotide sequence of the. probe was 



5 TGC TTG TGG ATG ATG CTA CTC ATA TCC CAA 3' 



EP 0 318 216 Al 



The HCV cONA sequence of clone i2l. us overlap with done i4i. ana mo am.no acds encoaeo mo.om are 
shown m Fig. 27. 

Tr.a isolation of clone 351. which contains cDNA downsirearr of the HCV cDNA m F.g. 26. was 
5 accomplished using a hybridization probe based on iho sequence of nucieonaes m clone 39c The 
nucleotide sequence of the probe was 

5* AGC AGC GGC GTC AAA AGT GAA GGC TAA CTT 3*. 

'0 The sequence of clone 35f. its overlap w.ih the sequence in done 39c. and the ammo acds encoded 
therein are shown in Fig. 28. 

The isolation of done i9g was accomplished using a hybridization prcje based on the 3 sequence of 
done 35f. The nucleotide sequence of the probe was 

5 5' TTC TCG TAT GAT ACC CGC TGC TTT GAC TCC 3*. 

The HCV cONA seouence of done i9g. its overlap w.th the sequence .n clone 35f. and the amino acds 
encoded therein are shown m Fig. 29. 

The isolation of clone 26g was accomplished using a hybndization probe based on the 3' sequence of 
V done I9g. The nucleotide sequence of the probe was 

S' TGT GTG GCG ACG ACT TAG TCG TTA TCT GTG 3'. 

The HCV cDNA sequence of clone 25y. its overlap w.th the sequence .n done i9g. and the ammo 
5 acids encoded therein are shown in Fig. 30. 

C:one i5e was isolated using a hybridisation probe based on the 3* sequence of done 26 g. The 
nucleotide sequence of the probe was ^ 

5 CAC ACT CCA GTC AAT TCC TGG CTA GGC AAC 3*. 

The HCV cDNA sequence of clone I5e. its overlap w,th the sequence in clone 26g. and the ammo acids 
encoded therein are shown in Fig. 3i. 

The clones described in this Section have been deposited with the ATCC under the terms and 
conditions described in Section II.A.. and have been assigned the following Accession Numbers. 



lambda-gti 1 


ATCC 


Deposit Date 




No. 




clone l2f 


40514 


10 November 1988 


done 35f 


40511 


10 November 1988 


clone I5e 


40513 


10 November 1988 


clone k9*i 


40512 


10 November 1988 



The HCV cDNA sequences in the isolated clones described- supra, have been aligned to create a 
composite HCV cDNA sequence. The isolated clones, aligned in the s' to 3' direction are: i2f. I4i, 7f. 7e. 
8h. 33c. 40b. 37b. 35. 36. 31. 32. 33b. 25c. 14c. 8f 33f. 33g. 39c. 35f. I9g. 26g. and i5e. 

A composite HCV cDNA sequence derived from the isolated clones, and the amino acids encoded 
therein, is shown in Fig. 32. 



'^^^•^Q- Alternative f^ethod of isolating cDNA Sequences Upstream of the HCV cDNA Sequence in Clone 

Based on the most 5 HCV sequence in Fig. 32. which is derived from the HCV cDNA in done i2f. 
small synthetic oligonucleotide primers of reverse transcriptase are synthesized and used to bind to the 
corresponding sequence in HCV genomic RNA. to prime reverse transcription of the upstream sequences. 
The primer sequences are proximal to the known s'-terminal sequence of done I2f. but sufficiently 
downstream to allow the design of probe sequences upstream of me pnmer sequences. Known standard 
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momoos 01 pnmrng ann clon.ng arc usuo Tno .esull.ng cDNA l.orar.ot are scveur.ucj w.ih Soguence. 
upsuoam 01 the pr.m.ng s.ies (as deducoc irom mo oiucaaiea seguonce .r. dont, The nCv gcru m'r 
RNA .s Obtained from e.lher plasma oi l.vei sampios Irom cMimparu-eus «.lh NANBh. o/ from anaion^n.' 
samples from humans wiin NANBH. 

IV^ Alternaiive Memod Ut.l.z.ng Ta.i.ng to Isolate Sequence s from the S'-Termmal Req.on of the HCv 
Genome 

In order to isolate the extreme S'-iermmai sequences of the HCV RNA genome, the cDNA product ot 
the first round of reverse transcnpiion. which .s duplexed with the template RNA. .s tailed with ol.go C This 
.s accomplished by .ncubaiing the product with term.nai transferase m the presence cf CTP The second 
round of cONA Synthesis, wh.ch yields the complement of the first strand of cONA. is accomplished utiiu.ng 
ol.go G as a pnmer lor the reverse transcriptase reaction. The sources of genomic HCV RNA are as 
described in Section IV.A.20. The methods for tailing with terminal transferase, and lor the reverse 
transcriptase reactions are as m Mamatis e! al (i982). "he cONA products are then cloned screened and 
sequenced. 
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Genome ^'^^'"^^""^ ^^^^^^ ^^'''"^ !2 '^^'^'^ Sequences from me 3'-Terminai Reg.on of the HCV 



This method .s based on previously used methods lor ciomr.g cDNAs of Flavivirus RNA. In ih.s method, 
the RNA is subjected to denatunng conditions to remove secondary structures at the s'-termmus and is 
r5 then tailed with Poly A polymerase using rATP as a substrate. Reverse transcnpticn of the poly A tailed 
RNA is catalyzed by reverse transcriptase, utilctng ohgo oT as a pnmer The second strands oi cDNA are 
synthesized, the cDNA' products are doned. screened, and sequenced. 

30 IV.A.23 Creation of Lamoda-gti 1 HCV cDNA Libraries Containing Larger cDNA Inserts 

The method used to create and screen the Lambda gtn libraries are essentially as described in 
Section IV. A. 1.. except that the library is generated from a pool of larger size cDNAs eluted from the 
Sepharose CL-4B column. 

5 

'V A-^^- Creation of HCV cDNA Libranes Using Synthetic Oligomers as Primers 

New HCV cDNA libraries have been prepared from the RNA derived from me infectious chimp?nzee 
) plasma pool described in Section IV.A.1.. and from the poly A* RNA fraction derived from me liver of this 
infected animal. The cDNA was constructed essentially as described by Gubier and Hoffman (1983). except 
that the primers for the first cONA strand synthesis were two synthetic oligomers based on the sequence of 
the HCV genome described supra. Primers based on the sequence of clone \ \tian<i 7e were, respectively. 

5' CTG GCT TGA AGA ATC 3' 
and 

5' AGT TAG GCT GGT GAT TAT GC 3*. 

The resulting cDNAs were cloned into lambda bacteriophage vectors, and screened with various other 
synthetic oligomers, whose sequences were based on the HCV sequence in Rg. 32. 

INAB- Expression of Polypeptides Encoded Withm HCV cDNAs and identification of the Expressed Products 
as HCV Induced Antigens. 

'V B.i Expression of :he Poivceptide Encoded in Clone 5-i-i. 
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The HCV poiypopi.ae encoded w.ihm clone 5-m {see S«C!.un iv.a 2 swprai ^as oxprussttO as a 
lus.on polypeptide w.ih superoxide disnnuost (SOD). This was accompi.sneo by subdon.MQ ine clone 5.I., 
cDNA .nseri into me expression vector pSODcM (Sioimer ei al. (1986)) as lollops 

First. DNA isolated from pSODcli was ireatod wun Bamm ana EcoRi and me 'oiiow.nrj i.f.ker was 
iigated mto the linear DNA created by the restriction en^yrr^ec: 

5 ' GAT CCT GGA ATT CTG ATA A 3 ' 

3* GA CuT TAA GAC TAT TTT AA 5' 

After cloning, the plasmid containing the insert was isolated. 

Piasmid containing the insert was restricted w.ih EcoRI. The HCV cDNA insert m clone 5-1-1 was 
excised w.th EcoRl. and Iigated into this EcoRi linearized plasmid DNA. The DNA mixture was used to 
transform E. coli strain Di2i0 (Sadler et al. (1980)). Recompinants w.ih me "-i-i cDNA m the correct 
orientation (or expression of the ORF sho^n m Ftg. 1 were identified by resir.ct.on mapping and nucleotide 
sequencing. 

Recombinant bactena from one done were induced to express the SOD-NANBs polypeptide by 
growing the bactena in the presence of IPTG. 



'V S. 2. Expression of the Polypeptide Encoded in Clone 81 . 

The HCV cDNA contained withjn clone 81 was expressed as a SOD-NANBa- fusion polypoptide. The 
method for preparing the vector encoding tms fusion polypeptide was analogous to tnai used: 'or the 

creation of the vector encoding SOD-NANB* . except that the source of the HCV cDNA was clone 81. 

which was isolated as described in Section IV.A.3. and for which the cDNA sequence was deterrriined as 
described in Section IV.A.4. The nucleotide sequence of the HCV cONA in clone 81, and the putaiiv^ amino 
acid sequence of the polypeptide encoder" tl.orem are shown in Fig. 4. 

The HCV cONA insert in clone 81 was excised with EcoRI, and Iigated into the pSOOcfl which 
contatnea the linker (see IV.B.l.) and which was linearized by treatment with EcoRI. The DNA mixture was 
used ic transform E. coli strain Di2l0. Recombinants with the clone 81 HCV cDNA in the correct orientation 
for expression of the ORF shown in Fig. 4 were identified by restriction mapping and nucleotide 
sequencing. 

Recombinant bacteria from one clone were induced to express the SOD-NANBs^ polypeptide by 
growing the bacteria in the presence of IPTG. 



iCentification of the Polypeptide Encoded Within Clone 5-1-1 as an HCV and NANBH Associated 
Antigen. " 

The polypeptide encoded within the HCV cDNA of clone 5-1-1 was identified as a NANBH associated 
antigen by demonstrating that sera of chimpanzees and humans infected with NANBH reacted im- 
munologicaliy with the fusion polypeptide. SOD-NANBs-i-t. which is comorised of superoxide dismutase 
at its N-terminus and the in-frame 5-1-1 antigen at its C-ierminus. This was accomplished by "Western" 
blotting (Towbin et al. (1979)) as follows. 

A recombinant strain of bacteria transformed with an expression vector encoding the SOD-NANBs-i 
polypeptide, described in Section IV.B.l.. was induced to express the fusion polypeptide by growth in the 
presence of IPTG. Total bacterial lysate was subjected to electrophoresis through polyacrylamide gels m 
the presence of SOS according to Laemmli (1970). The separated polypeptides were transferred onto 
nitrocellulose fillers (Towbin et al. (1979). The filters were then cut into thin strips, and the strips were 
incubated individually with the different chimpanzee and human sera. Bound antibodies were detected by 
further incubation with '2S|.|apeled sheep anti-human Ig. as described in Section IV.A.1. 

The characterization of the chimpanzee sera used for the Western blots and the results, shown in the 
photograph of the autoradiographed strips, are presented in Fig. 33. Nitrocellulose strips contairing 
polypeptides were incubated with sera derived from chimpanzees at different times during acute NANBH 
(Hutchinson strain) infections (lanes 1-16). hepatitis A infections (lanes 17-24. and 26-33). and hepatitis B 
infections (lanes 34-44). Lanes 25 and 45 show p^osttive coritrois in which the immunobiois were incubated 



*..n se.um from (he pat.ani used to .aeoMy ,he -ecomb.nan, :,one 5-.-I ,n me ong.nai sc^eenmn ai ,n» 
lamoda-giii cONA iiorary (see Section iv.A i ). ^ ^ 

The band v.sibia m ,he control la. es. 25 and 45. .n F,g 23 ,e(iec>s me b.nd.ng of ani.ood.e- ,0 me 
NANBs-.-. morety 0. me SOD (us.on poiypepl.de. These an.ibod.es do nci e.n.b.t b.ndmg -.o SOD 11 
5 s.nce th.s has also been .ncludea as a negative control ,n ,hese samples, and would have appeared L a 
band migrating s.g.i.f.cantly (aster than the SOD-NANBi-, -, (us.on polypepi.de. 

Lanes I- 16 of Fig. 33 show the b.nd.ng of antibodies in sera samples of 4 chimpanzees; the sera wer« 
obtained just pr.or to infection wim NANBH. and sequentially during acute .niection. As seen from me fioure 
whereas ani.bod.es which reacted im-^unolog.cally with me SOD-NANB^-, - polypeptide were absent ,n 
.0 sera samples obtained before administration o( inlectious HCV inoculum and during the early acute phase 
01 .n(ect.on. all 4 animals eventually induced circulating antibodies to this polypeptide during me tate pan o( 
or fotiowing the acute phase. Additional bands observed on the immunobiots in the cases of chimps 
numbers 3 and 4 were due to background binding to host bacte:'^te.ns. 

in contrast to the results obta.ned w.m sera from chimps infected w.m NANBH. the development of 
.5 antibodies to the NANBs-, moiety o( the (us.on polypeptide was not observed in 4 chimpanzees imecied 
w.tn HAV Of 3 Chimpanzees in(ectei wim H8V. The only binding .n these cases was background bindi.-g to 
the host bacte"ai proteins, which also occurr; j in thd HCV in(ected samoies. 

The Characterization o( the human sera used for me Western blots, ana me results wn.ch are shown ,n 
the photograph of the auloradiographed strips, are presented in Fig. 34. Nitrocellulose strips contaimnq 
^0 polypeptides were incubated w.m sera derived from humans at different t.mes during infections with 
NANBH (lanes 1-21). HAV (lanes 33-40). and HBV (lanes 41-49). Lanes 25 and 50 show positive controls in 
which me immunobiots were incubated wim serum from patient used in the or.gmal screening of the 
lambda-gtll library, described supra. Lanes 22-24 and 26-32 show -non-mfecied- controls m which the 
sera was from "normal" blood donors. 
:s As seen in Fig. 34. sera from nine NANBH patients, including the serum used lor screening me lambda- 
gtn library, contained antibodies to the NANB,-,- mcety of me fusion polypeptide Sera from mree 
pat-ents w.m NANBH did not contain these antibodies. It .s possible that the ant.-NANBs- -, antibod.es will 
develop at a future date in mese patients, ft is also possible mat th.s lack of reaction resulted from a 
different NANBV agent being causatr ? of me disease .n the individuals from which me non.respondinq 
30 serum was taken. ^ 

Fig. 34 also shows mat ^era from many patients infected wim HAV and HBV did not conta.n an'i- 
NANBs-.-. antibodies, and mat these antibodies were also not present m me sera from "normal" controls 
Although one HAV patient (lane 36) appears to contain anti-NANBs-, - antibodies, .t is possible that m.s 
patient had been previously infected wim HCV. since me incidence of NANBH is very high and s.nce it is 
35 often subclinical. 

These serological studies indicate that me cONA m clone S-i-i encodes epitopes which are recognized 
spec.f.cally by sera from patients and animals infected with BB-NANBV In add.iion. the cDNA does not 
appear to be derived (rom me primate genome. A hybr.dizat.on probe made trom clone 5-i-i or from done 
81 d.d not hybridize to "Southern" blots of control human and chimpanzee genomic DNA from uninfected 
^ .ndividua(s under conditions where unique, single-copy genes are detectable. These probes also did not 
hybndize to Southern blots of control bovine genomic DNA. 
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IVJA Expression of the Polypeptide Encoded in a Composite of the HCV cONAs m Clones 36. 81 and 32 



The HCV polypeptide which is encoded in the ORF which extends through clones 36. 81 and 32 was 
expressed as a fusion polypeptide with SOD. This was accomplished by inserting the composite cDNA. 
CiOO. into an expression cassene which contains me human superoxide dismutase gene, .nsenino the 
expression cassette into a yeast expression vector, and expressing the polypeptide in yeast. 

so An expression cassene containing me composite ClOO cDNA derived from clones 36. 81. and 32. was 
constructed by inserting the -I270bp EcoRI fragment into the Ecofll site of me vector pS3-56 (also called 
?S3S6). yielding the plasmid pS3-56c.oo- The construction of C1OO is described in Section IV.A.16. supra. 

The vector pS3-56. which is a pBR322 derivative, contains an expression cassette which is comprised 
of me ADH2/GAPDH hybrid yeast promoter upstream of the human superoxide dismutase gene, and a 

55 downstream GAPOH transcription terminator. A similar cassette, which contains these control elements and 
the superoxide dismutase gene has been described in Cousens et al. (1987). and in copending application 
EPO 196.056. published October 1. i986. which is commonly owned by the herein assignee. The cassene 
in PS3-56. however, differs from mat m Cousens et al. (1987) m mat me .►heterologous promsuiin gene and 
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me .mmunogiooulin h.nge are doieied. and ,n mat ihe gin. of me ?uperox,de o.smuiase .s fonoweo by 
adaptor sequence which conia(n$ an EcoRl site The sequence of ir..^ adaptor is 

5'-AAT TTG GGA ATT CCA TAA TGA G -3' 

AC CCT TAA GGT ATT ACT CAG CT 



The EcoRI sue allows the insertion of heterologous sequences which, when expiessed from a vector 
contaming- We cassette, yield polypeptides which are fused to superoxide dismuiase via an oligopeptide 
linker containing the amino acid sequence: 
■asn-leu-gly-iie*arg-. ' 



A sample of pS356 has been deposited on 29 April 1988 under the terms of tne Budapest Treaty with 
the American Type Culture Collection (ATCC). 12301 Park'iwn Dr.. Rockville. Maryland 20853. and has 
been assigned Accession No. 67683. The terms and conditions for availability and access i6 the deposit. 
2rd for maintenance of me deposit are me same as those specified in Section il.A.. for strains containing 
N.^NBV-cDNAs. This deposit is intended for convenience only, and is not required to practice the present 
invention in view of the description here. The deposited material is hereby incorporated herein by reference. 

After recombinants containing me ClOO cDNA insert in me correct orientation were isolated, me 
expression cassette containing the ClOO cONA was excised from pS3-56cioo with BamHI. and a fragment of 
-3400bp which contains the cassette was isolated and purified. This fragment was then inserted into me 
BamHI site of the yeast vector pAB24. 

Plasmid pA824. the significant features of wmch are shown in Fig. 35. is a yeast shunie vector which 
contains me complete 2 micron sequence for replication [Broach [i98l)] and pBR322 sequences, ll^also 
contains me yeast URA3 gene derived from plasmid YEp24 [Botstein et al. 1979)). and the yeast LgU^*' 
gene derived from plasmid pCvi. EPO Pub. No. ti6.20i. Plasmid pAB24 was constructed by* digesting 
YEp24 with EcoRl and religating the vector to reroove the partial 2 micron sequences. The resulting plasmid 
YEP24deltaRI. was linearized by digestion with Ctal and ligated wim the complete 2 micron plasmid which 
had been iineanzed with Clal. The resulting plasmid. pCBou. was then digested with Xbal and tne 8605 bp 
vector fragment was gel isolated. This isolated Xbal fragment was ligated with a 4460 bp Xbal fragment 
containing me LEU^** gene isolated from pCI/l : the orientation of LEU^" the gene is in me same direction as 
the URA3 gene. Insertion of the expression was in the unique BamHI site of the pBR322 sequence, thus 
interrupting the gene for bacterial resistance to tetracycline. 

The recombinant plasmid which contained me SOO-ClOO expression cassene. pAB24CiOO-3. was 
transformed into yeast strain JSC 308. as well as into omer yeast strains. The eel's were transJormed as 
described by Hinnen et al. (1978). and plated onto ura-selective plates. Single colonies were inoculated into 
leu-selective media and grown to saturation. The culture was induced to express the SOO-CiOO polypeptide 
(called C 100-3) by growm in YEP containing 1% glucose. 

Strain JSC 308 is of the genotype f^AT @. Ieu2. ura3(del) DM15 (GAP/ADR 1) integrated at me ADRl 
locus. In JSC 308. over-expression of the positive activator gene product, AORl. results in hyperderepres- 
siop (relative to an ADRi wild type control) and significantly higher yields of expressed heterologous 
P'nteins when such proteins are synthesized via an ADH2 UAS regulatory system. The construction of the - 
yeast strain JSC 308 is disclosed in copending application. U.S. Serial No. (Attorney Docket No. 2300- 
0229). filed concurrently ^ 3rewith. and which is hereby incorporated herein by reference. A sample of JSC 
308 has been deposited on 5 May 1988 with the ATCC under the conditions of the Budapest Treaty, and 
has been assigned Accession No. 20879. The terms and conditions for availability and access to the 
deposit, and for maintenance of the deposit are the same as those specified in Section \\A.. for strains 
containing HCV cDNAs. 

The complete C 100-3 fusion polypeptide encoded in pAB24C 100-3 should contain 154 amino acids of 
human SOD at the amino-terminus. 5 amino acid residues derived from the synthetic adaptor containing the 
EcoRl site. 363 amino acid residues derived from ClOO cONA. and 5 carboxy-terminal amino acids derived 
from me MS2 nucleotide sequence adjoining the HCV cONA sequence in clone 32. (See Section IV.A.7.) 
The putative amino acid sequence of the carboxy-terminus of this polypeptide, beginning at the penultimate 
Ala residue of SOD. is shown in Fig. 36; also shown is the nucleotide sequence encoding this portion of the 
Doiypeptide. 



39 



IVBj. idenM^cahon ci me Poiypepnde Encodog w,ih.n CiQG as a>) NA.jaH Assoc.aieo Am.gc-n 

The C 100-3 fusion polypepi.de expressed from piasm.d pAB24C 100-3 in yeas, sira.n JSC 3oa wao 
cliaracienced w.ih ,especi to s,ze. and me polypepi.de encoded w.m.n CiOO was .oeni.l.ed as an nanbh 
assoc.ated ani.gan By ,is immunological reactw.iy w.m serum Irom a human wiih chrome NANBH 

P^'^P^P'"^^- "^'^'^ expressed as described m Section iv.B.4.. was analyzed as loiiows 
Yeast JSC 308 cells were transformed w.th pAB24. or with pAE24Cl00-3. and were induced to express the 
heterologous piasm.d encoded polypeptide. The induced yeast cells m i ml o( culture (00„o -n. -20) were 
pellQlfid by centnlugaiiqn at 10.000 rpm lor i minute, and were lysed by vonexing them vigorously (lO . i 
mm) with 2 volumes ol solution and l volume ol glass beads (0.2 millimicron diameter). The soi-.tion 
contained 50 mM Tris-HCl. pH 8.0. 1 mM EDTA. imM phenylmethylsulphonyl fluor.de (PMSF) and 1 
m.crogram..ml pepstatin. Insoluble material in the lysate. which includes the C 100-3 polypeptide was 
collected by centrilugation (lO.OOO rpm for 5 m.nutes). and was dissolved by boH.ng lor 5 minutes .n- 
Laemmli SOS sample buffer. [See Laemmli (1970)). An amount ol polypeptides equivalent lo that m 0 3 ml 
ol the induced yeast culture was subjected to electrophoresis through 10% polyacryiam.de gels .n the 
presence of SOS according to Laemmii (1970). Proie.r standards were co-electrophoresed on the gels Gels 
conta.n.ng the exj,.essed polypeptides were either stained wnh Coomassie brilliant blue or were subjected 
to -Western- blotting as described in Section IV.B.2.. using serum from a patient w.th chrome NANBH to 
determine the .mmunolog.cal reactivity of the polypeptides expressed from pAB24 and from pAB24C 100-3 

The results are shown in Fig. 37. In Fig. 37a the polypeptides were stained with Coomass.c brilliant 
blue. The insoluble poiypeptide(s) Irom JSC 308 transformed with pA824 and from two different colonies ol 
JSC translormed w.th pAB24C 100-3 are shown in lane 1 (pAB24). and lanes 2 and 3. respectively A 
comparison ol lanes 2 and 3 with lane 1 shows the induced expression of a polypeptide corresponding to a 
molecular weight ol -54.000 daltons from JSC 308 transformed with pAB24C 100-3. wh.ch ,s not .nduceo n 
.'5 JSC 308 transformed with pAB24. This polypeptide is indicated by the arrow. 

Fig. 37B Shows the results ol the Western blots ol the insoluble polypeptides expressed m JSC 308 
translormed w.ih pAB24 (lane 1). or with pAB24Ci00-3 (lane 2). The polypeptides" expressed irom pAB24 
were not immunologically reactive with serum Irom a human with NANBH. However, as indicated by the 
arrow. JSC 308 translormed with pA3i.4C 100-3 expressed a polypeptide of -54.000 daiton molecular 
weight which did react immunologically with the human NANBH serum. The other irfmunoiogicaliy reactive 
polypeptides m lane 2 may be degradation and/or aggregation products of this -54.000 daiton polypeptide 
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IV.B.6. Purification of Fusion Polypeptide ClOO-3 

The fusion polypeptide. C 100-3. comprised ol SOD at the N-term,nus and in-frame CiOO HCV- 
polypeptide at the C-terminus was purified by differential extraction of the insoluble fraction of the extracted 
host yeast cells in which the polypeptide was expressed. 

The lusion polypepUde. ClOO-3. was expressed in yeast strain JSC 308 transformed with pAB24Ci00-3 
as descnbed m Section IV.B.4. The yeast cells were then lysed by homogenization. the insoluble material in 
the lysate was extracted at pH 12.0. and ClOO-3 in the remaining insoluble fraction was solubilired in buffer 
containing SOS. 

The yeast lysate was prepared essentially according to Nagahuma et al. (1984). A yeast cell suspension 
was prepared which was 33% cells (wv) suspended in a solution (Buffer A) containing 20 mM Tris HCl. pH 
8.0. 1 mM dithiothreitol. and 1 mM phenylmethylsullonyllluoride (PMSF). An aliquot ol the suspension (15 
ml) was mixed witn an equal volume of glass beads (0.45-0.50 mm diameter), and the mixture was vortexed 
at top speed on a Super Mixer (Lab Une Instruments. Inc.) for 8 min. The homogenate and glass beads 
were separated, and the glass beads were washed 3 times with the same volume of Buffer A as the original 
packed cells. After combining the washes and homogenate. the insoluble material in the lysate was 
obtained by centrifuging the homogenate at 7.000 x g for 15 minutes at 4'C. resuspending the pellets in 
Buffer A equal to twice the volume of original packed cells, and re-pelleting the material by centrifugation at 
7.000 X g for IS min. This washing procedure was repeated 3 times. 

The insoluble material from the lysate was extracted at pH 12.0 as follows. The pellet was suspended in 
buffer containing 0 5 M NaCl. l mM EDTA. where the suspending volume was equal to 1.8 times the of the 
original packed cells. The pH of the suspension was adjusted by adding 0.2 volumes of 0.4 M Na 
phosphate buffer. pH 12.0. After mixing, the suspension was centrifuged at 7.000 x g for 15 mm at 4*C. 
and the super natant removed. The extraction was repestefl 2 times. The extracted pellets were washed by 
suspending them in 0.5 M NaCI. i mM EDTA. using a suspension volume equal to two volumes ol the 
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onginai packed cells, loiioweci b> cemnkigauc • at 7.000 * g lor i5 «nin ai 4* C 

The C 100-3 polypepi.de m me exiractod peiio! was soiuD.i.JCd Dy i,eaiir,«r»i wm. SOS Tno pallets ,vere 
suspended .n Butter A equal lo 0 9 volumes ol the or.gmal packed cell volume, and 0 I voiurnos ol J-. SOS 
was added. Aftoi me suspension was muod. n was centnluged at 7.000 n g tor »5 mm ai 4'c Thb 
resulting pellet was extracted 3 more t.mes w.ih SDS. The resuit.ng supematanis. wh.ch contained C 100-3 
were pooled. 

This procedure purifres C 100-3 more than tO-foid Irom the msoluWe fraction of the yeast homogenate 
and the recovery ol the polypeptide is greater than 50%.. 

The purified preparation or fusion polypeptide was analyzed Dy polyacryiamide gel electrophoresis 
according to Laemmli (1970). Based upon mis analysis, me polypeptide was greater than 80% pure, and 
had an apparent molecular weight of -54.000 daltons. 



'V C Identification of RNA in Infected Individuals Which Hybridizes to HCV cDNA. 



'oentification of RNA in me Lwer of a Chimpanzee With NANBH Wh.ch Hybridizes to HCV cDNA. 

RNA from the liver of a chimpanzee which had NANBH was shown to contain a speoes of RNA which 
hyoridized to the HCV cDNA contained wimm done 8l by Northern blotting, as follows. 

RNA was isolated from a liver biopsy of the chimpanzee from which the high titer plasma was denved 
(see Section iv.A.l.) using techniques described in iviamatis et al. (1982) for the isolation of total RNA from 
mammalian cells, and for its separation into poiy A* and poly A" fractions. These RNA fractions were 
subiecteo to electrophoresis on a formaldehyde agarose gel (i% w-v). ano transferred to nitrocellulose, 
(fvtaniatis et al. (1982)). The nitrocellulose fillers were hybridized with radiolabeled HCV cONA from ctone 81 
(see Fig. 4 for the nucleotide sequence ol the insert.) To prepare me radiolabeled Qtotae, the HCVicONA 
insert isolated from clone 8i was radiolabeled with -^^P by mck translation using DNA Polymerase I 
(fvlaniatis et al. (1982)). Hybr.cization was f^r :C hours at 42* C in a solution containing 10% (w-v) Dextran 
sulphate. 50% (w.v) deionized formamide. 750 ml^ NaCI. 75 mf^ Na citrate. 20 mM NarHPOt. pH 6.5. 
0.1% SDS. 0.02% (w/v) bovine serum albumin (BSA). 0.02% (w;v) Ficoll-400. 0.02% (w/v) polyvinylpyr- 
rolidone. 100 micrograms^ml salmon sperm DNA which had been sheared by sonication and denatured, and 
10* CPM mi of the nick-transiated cDNA. probe. 

An autoradiograph of me probed filter is shown in Fig. 38. Lane i contains ^^p.iadeled restriction 
fragment markers. Lanes 2-4 contain chimpanzee liver RNA as follows; lane 2 contains 30 micrograms of 
total RNA: lane 3 contains 30 micrograms of poly A. RNA; and lane 4 contains 20 micrograms of poly A * 
RNA. As shown in Fig. 38. me liver of the chimpanzee with NANBH contains a heterogeneous population of 
relatec! poly A * RNA molec^es which hybridizes to the HCV cDNA probe, and which appears to be from 
about 5000 nucleotides to aoout 1 1 .000 nucleotides in size. This RNA. which hybridizes to the HCV cDNA. 
could represent viral genomes and/or specific transcripts of the viral genome. 

The experiment described in Section IV.C.2.. mfra. is consistent with the suggestion that HCV contains 
an RNA genome. 



'V C.2. Identification of HCV Derived RNA m Serum from Infected Individuals. 



Nucleic acids were extracted from panicles isolated from high tiler chimpanzee NANBH plasma as 
described m Section IV.A.1.. Aliquots (equivalent to i ml of onginai plasma) of the isolated nucleic acids 
were resuspended in 20 microliters 50 mM Hepes. pH 7.5. i mm EDTA and 16 micrograms/ml yeast 
soluble RNA. The samples were denatured by boiling for 5 minutes followed by imm.ediate freezing, and 
were treated with RNase A (5 microliters containing 0.1 mg/ml RNase A m 25 mM EDTA. 40 mM Hepes. pH 
7.5) or with DNase I (5 microliters containing i unit ONase I in lO ml^ MgClj. 25 mM Hepes. pH 7.5); 
control samples were incubated without enzyme. Following incubation, 230 microliters of ice-cold 2XSSC 
containing 2 microgramsmi yeast soluble RNA was added, and the samples were filtered on a nitrocel- 
lulose filter. The filters were hybndized with a cDNA probe from clone 81. which had been «P-iabeled by 
nick-transiation. Fig. 39 shows an autoradiograph of the filter. Hybridization signals were detected in the 
ONase treated and control samples (lanes 2 and i. respectively), but were not detected in the RNase 
treated sample (lane 3). Thus, since RNase A treatment destroyed the nucleic acids isolated from the 
panicles, and DNase I treatment had no effect, the evidence strongly suggests that the HCV genome !S 
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IVC3. DeiecLon ol Amphfied HCV Nuclo.c Aod Sequences der.ved irom hCV Nucie.c Acd Soou^n^c n 
5 Uver and Plasma Soec.mens Irom ChiT^^ijigT win NANBH ~ - - 

n,„!l« ^"'^ P'"*"' ^' ^'^""Paniees w.th NANBH. and .n conuo. cn.m- 

.o ri^lfi r ohgonuJeofdes were derived from the HCV cDNA sequences in clone St or 

.0 clones 36 and 37. The amplified sequences were delected by gel electrophoresis and Southern biotimg 

using as probes the appropriate cDNA oligomer with a sequence Irom the reg.on between but hoi 

including, the two primers. 

Samples of RNA conta.ning HCV sequences to be exam.neJ by the amplif.cai.on system were isolated 
TrT, T'' k'''* Chimpanzees with NANBH. and from two control chimpanzees. The isolation o. 
'S ihe RNA fraction was by the guanidinium thiocyanate procedure described in Section IV C t 

.iJjyV, °' "k^^ "^'^ amplification system were also isolated from .he 

p asmas o( tw. chimpanzees with NANBH. and from one control chimpanzee, as well as from a pool of 
plasmas from control chimpanzees. One infected chimpanzee had a CiD'ml equal to or greater than lO* 
and the other infected chimpanzee had a ClD/ml equal lo or greater than lOV 

?o The nucleic acids were extracted from the plasma as lollows. Either O.t ml or 0 01 nl of plasma was 
diluted to a final volume of 1.0 ml. with a TENB/proieinase fCSDS solution (0.05 M Tris-HCL pH 8 0 0 001 
M EDTA. 0.1 M NaCI. 1 mg..ml Proteinase K. and 0.5% SDS) containing lO micrograms-ml polyadeny.ic 
acid, and mcubated at 37 C for 60 minutes. After this proteinase K digestion, the resultant plasma fractions 
were deproieinized by extraction with TE (lO.O mM Tris-HCI. pH 8.0. i mf^ EDTA) saturated phenol The 

■5 phenol phase was separated by cenirifugation. and was reextracted with TENS containing 0.:i% SDS The 
resulting aqueous phases from each extraction were pooled, and extracted twice with an equal volume of 
pheno /chloro(ornr,..isoamyl alcohol [1:1(99:2)). and then twice with an equal vofume ol a 99:1 mixture of 
chiorolorm-isoamyi alcohol. Following phase separation by cenirifugation. the aqueous phase was brought 
to a final concentration of 0.2 M Na Acetate, and the nucleic acids were precipitated by the addition of two 

' IIT^ o' ethanol. The precipitated nude.c ac.ds were recovered by ultracentrifugation ir, a SW 4i rotor at 
38 K, for 60 minutes at 4 C. 

In addition to the above, tha high titer chimpanzee plasma and the pooled control plasma altarnativeiy 
were extracted with 50 micrograms of poly A carrier by the procedure of Chomcyzsk. and Sacchi (1987) 

c ^nLToo^.T 9va"*dinium thiocyanate extraction. RNA was recovered by centrifugation at 

5 10.000 RPfVI for 10 minutes at 4 C in an Eppendorf microfuge. 

On two occasions, prior to the synthesis of cDNA in the PCR reaction, the nucleic acids extracted from 
plasma by the proteinase lOSDS/phenol method were further purified by binding to and elution from S and 
S Elutip-R Columns. The procedure followed was according to the manufacturer's directions 

The CDNA used as a template for the PCR reaction was denved from the nucleic acids {either total 

) nucleic acids or RNA) prepared as described above. Following ethanol precipitation, the precipitated nucleic 
acids were dried, and resuspended in OEPC treated distilled water. Secondary structures in the nucleic 
acids were disrupted by heating at 65 C for 10 minutes, and the samples were immediately cooled on ice 
CDNA was synthesized using i to 3 micrograms of total chimpanzee RNA from liver, or from nucleic acids 
(or RNA) extraaed from 10 to 100 microliters of plasma. The synthesis uUlized reverse transcriptase and 
''n!.-!" * '"'c^f'tw reaction, using the protocol specified by the manufacturer. 8RL The primers for 
^Sm!^ -^ere these also utilized in the PCR reaction, described below. All reaction mixtures for 

CDNA synthesis contained 23 units of the RNAase inhibitor. RNASlNTVi (Rsher.'Promega). Following cDNA 
synthesis, the reaction mixtures were diluted with water, boiled lor 10 minutes, and quickly chilled on ice. 

The PCR reactions were performed essentially according to the manufacturer's directions (Cetus- 
. srkin-Elmer). except for the addition of 1 microgram of RNase A. The reactions were earned out in a final ' 
volume of 100 microliters. The PCR was performed for 35 cycles, utilizing a regimen of 37 'C. 72 "C. and 
94 C. 

The primers for cDNA synthesis and for the PCR reactions were derived from the HCV cDNA 
sequences :n either clone 81. clone 36. or clone 37b. (The HCV cDNA sequences of clones 81. 36. and 37b 
are shown m F>gs. 4. 5. and 10. respectrvely.) The sequences of the two i6-mer primers derived from clone 
81 were: 

S' CAA TCA TAC CTG ACA G 3' 
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and 

5' GAT /.AC CTC TGC CTG A 3 



5 



0 



Thr. sequdnce o* »he pnmer from clone 36 was; 
5' GCA TGI CAT GAT GTA T 3'. 
The sequence oMhe prrmer (rom clone 37b w;»s; 
0 S' ACA ATA CGT GTG TCA C 3*. 

In the PCR reactions, the primer pairs consisted of either the two i6-me'S derived from done 81. or the 16- 
mer from clone 36 and the te-mer from clone 37b. 

The PCR reaction prrxlucts v^'ere analyzed by separation of the products by alkaline gel electrophoresis 
5 followed by Southern blotting, and detection of the amplified HCV-cDNA sequences w.ih a ^-P-labeted 
internal oligonucleoi.d** orobe derived from a region the .iCV cONA which does not overlap the pr.mers 
The PCR reaction mixtures were extracted with phenoi/chloroform. and the nucleic acids precipitated from 
the aqueous phase with salt and ethanoi. The precipitated nucleic acids were collected by centnfugation 
and dissolved in distilled water. Aliquois of the samples were subjected to electrophoresis on 1.8% alkaline 
agarose gels. Single stranded DNA of 60. 108. . and 161 nucleotide lengths were co-elecirophoresed on the 
geis as molecular weight markers. Aiier electrophoresis, the DNAs in the gel were transferred onto Biorad 
Zeta ProbeTw paper! Prehybndization and hybridization, and wash conditions were those specified by the 
manufacturer (Biorad). 

The probes used for the hybndizatton-detectton of ampiified HCV cDNA sequences were the following 
When the pair of PCR pnmer s were derived from clone 81. the probe was an i08-mer w.ih a sequence 
corresponding to that which is located m the region between the sequences of the two primers. When the 
pair of PCR primers were derived from clones 36 and 37b. the probe was the mck-iranslated HCV cONA 
insert denved from clone 35. The pnmers are r'erivcd from nucleotides 155-170 of the done 37b insert, and 
206-268 of the clone 36 insert. The 3 -end of the HCV cDNA insert in done 35 overlaps nucleotides 1-1 86 
of ihe insert in clone 36; and the s'-end of done 35 insert overlaps nucleotides 207-269 of the insert in 
clone 37b. (Compare f^gs. 5. 8 and lO.) Thus, the cDNA insert in clone 35 spans part of the region between 
the sequences of the clone 38 and 37b derived primers, and is useful as a probe for the amplified 
sequences which include these primers. 

Analysis of the RNA from the liver specimens was according to the above procedure utilizing both sets 
of primers and probes. The RNA from the liver of the three chimpanzees with NANBH yielded positive 
hybnaization results for amplification sequences of the expected size (161 and 586 nudeotides for 8i and 
36 and 37b. respectively), while the control chimpanzees yielded negative hybridization results. The same 
results were achieved when the experiment was repeated three limes. 

Analysis of the nucleic acids and RNA from plasma was also according to the above procedure utilizing 
the primers and probe from clone 8t. The plasmas were from two chimpanzees with NANBH. from a control 
chimpanzee, and pooled plasmas from control chimpanzees. Both of the NANBH plasmas contained nucleic 
actds/RNA which yielded positive results in the PCR amplified assay, while both of the control plasmas 
yielded negative results. These results have been repeatably obtained several times. 



JD. Radioimmunoassay for Detecting HCV Antibodies in Serum from Infected individuals 



Solid phase radioimmunoassays to detect antibodies to HCV antigens were developed based upon Tsu 
and Herzenberg (1980). f^icrotiter plates (Immulon 2. Removawell strips) are coated with pi-rrfied polypep- 
tides containing HCV epitopes. The coated plates are incubated with either hum;*- serum samples 
suspected of containing antibodies to the HCV epitopes, or to appropriate controls. Juring incubation, 
antibody, if present, is immunologically bound to the solid phase antigen. After removal of the unbound 
material and washing of the microliter plates, complexes of human antibody-NANBV antigen are detected 
by incubation with '"l-labeled sheep anti-human immunoglobulin. Unbound labeled antibody is removed by 
aspiration, and the plates are washed. The radioactivity in individual wells is determined: the amount of 
oound human anti-HCV antibody is proponional to the radioactivity in the well. 
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'V D ^ Pufilication o\ Fusion Polypeptioe SOO-NANB«> t 



The fusion polypephoe SOD-NANB. expressed .n recombmani bactena as descnbed .n Sernon 
iV.B.t.. was pur.hed from the recomoinani E. coli by differential e^iracuon of the ecu e/iracis w.in Jrea 
followed by chromatography on anion ana canon exchange columns as follows. 

Thawed ceils from i liter ol culture were resuspended m lO ml of 20% (w.v) sucrose containing o Oif^ 
Tris HCI. pH a.O. and 0.4 ml ol 0.5M EDTA. pH 8.0 was added. After 5 m.nutes at O'C. the m.xiure was 
cenir.fuged at 4,000 x g lor 10 m.nutes. The resulting pellet was suspended in 10 ml of 25% (w.v) sucrose 
containing 0.05 M Tns HQ. pH 80. l mM phenylmethylsulfonylfluoride {PMSf} and 1 microgram.ml 
pepstatin A. followed by addition of 0.5 ml lysoiyme (10 mg-mi) and incubation at O' C for 10 m.nutes. After 
me addition of lO ml i% (v/v) Triton X-iOO in 0.05 M Tris HCl. pH 8,0. 1 mM EDTA. the mixture was 
incubated an additional lO min at O'C with occasional shaking. The result.ng v.scous solution was 
homogenized by passage 6 times through a sterile 20-gauge hypooerm.c needle, and centr.fuged at 13.000 
X g for 25 m.nutes. The pelleted material was suspended in 5 ml of 0.0 1 M Tr.s HCl pH 8.0. and the 
suspension centrifuged at 4.000 x g fi- (0 minutes. The pellet, which contained SOD-NANBs-» fusion 
protein, was dissolved m 5 ml of 6 M l'.o^ m 0.02 fVI -^ris HCl. pH 8.0. 1 mM dilh.othreitol (Buffer A), and 
was applied to a column of Q-Sepharr.se ^ast Flow equilibrated with Buffer A. Polypeptides were eluted 
with a linear gradient of 0.0 to 0.3 VI NaCl in Buffer A. After elution. fractions were analyzed by 
poiyacrylamide gel electrophoresis in the presence of SDS to determine ihe.r content of SOO-NANBs-.-,. 
F.-actions containing this polypeptide were pooled, and dialyzed against 6 M urea m 0.02 M sodium 
phosphate buffer. pH 6.0. 1 mM dithiothreitol (Buffer B). The dialyzed sample was applied on a column of 
S-Sepharose Fast Flow equilibrated with Buffer B. and polypeptides eluted w.th a linear gradient of 0.0 to 
0.3 M NaCI in Buffer B. The fractions were analyzed by poiyacrylamide gel electrophoresis for the presence 
of SOD-NANBs-' ana the appropriate fractions were pooled 

The final preparation of SOO-NANB*.. polypeptide was examined by electrophoresis on 
poiyacrylamide gels m :ne presence of SOS. Based upon this analysis, the preparation was more- than 80% 
pure. 



IV D.2. Purification of Fusion Polypeptide SOD-NANBg-.. 

The fusion polypeptide SOD-NANBa: expressed in recombinant bacteria as described in Section 
tv.B.2.. was purified from recombinant E. coli by differential extraction of the cell extracts with urea, followed 
by chromatography on anion and cation exchange columns utilizing the procedure described for the 
isolation of fusion polypeptide SOD-NANB5-. -i (See Section IV.D.I.). 

The final preparation of SOD-NANBs: pbiypeptide was examined by electrophoresis on poiyacrylamide 
gels in the presence of SDS. Based upon this analysis, the preparation was more than 50% pure. 



Detection of Antibodies to HCV Epitopes by Solid Phase Radioimmunoassay. 

Serum samples from 32 patients who were diagnosed as having NANBH were analyzeo by radioim- 
munoassay (RIA) to determine whether antibodies to HCV epitopes present in fusion polypeptides SOD- 
NANB$-t-^ and SOD-NANBb, were detected, 

Microtiter plates were coated with SOD-NA.NB5-1 or SOD-NANBu. which had been partially purified 
according to Sections IV.D.I. and iV.D-2.. respectively. The assays were conducted as follows. 

One hundred microliter aliquots containing 0.1 to 0.5 micrograms of SOD-NANB5 -1 -1 or SOD-NANBsr 
in 0.125 M Na berate burter. pH 8.3. 0.075 M NaCI (BBS) was added to each well of a microtiter plate 
(Dynatech Immulon 2 Removawell Strips). The plate was incubated at 4"C overnight in a humid chamber, 
after which, the protein solution was removed and the wells washed 3 times with BBS containing 0.02% 
Triton X-100 (BBST). To prevent non-specific binding, the wells were coated with bovine serum albumin 
(BSA) by addition of lOO microliters of a 5 mg/ml solution of BSA in BBS followed by incubation at room 
temperature for 1 hour: after this incubation the BSA solution was removed. The polypeptides in the coated 
wells were reacted with serum by adding 10O microliters of serum samples diluted MOO in O.OiM Na 
phosphate buffer. pH 7.2. 0.15 M NaCi (PBS) containing 10 mg/ml BSA. and incubating the serum 
containing wells for i hr at 37' C. After incubation, the serum samples were removed by aspiration, and the 
wells were washed 5 times with BBST. Anti-NANBs-i and Anti-NANBa: bound to the fusion oolypeotides 
was aeiermined by the binding of '^-l-iabeied P' (ab)? sheep anti-human igG to the coated wells. Aliquots of 
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100 m.croMers o( ihe laDeled probe cpec.f.c aci.v.ty 5-20 m.crocur.es microgram) wore aaoeo .o eacr. we. 
ar^o me piaies we,e .ncubaied at 37 c lor ^ hour, followed by removal oi excoss p.ooe by asollT.nn 
washes wuh BBST The amoun, ol rad.oaci.v.iy bound .n each well was aoierm.r.ea by cr^ 1 n 
counter which aoioc!S gamma radation. ^rji.nnng ,n 

The results ol the deiect.on of ant.-NANBs and ani-NANBe. .n .no.v.duals w„h nansh 
piesenied »n Table 1. r^MiNdn ». 



Pa Lieut 
n» fe fence 



Table 1 

Detection o£ Anti-5-l-l end Anti-8I in Sera of 
HKHB, KAV and HBV Hepatitis Patients 



S/H 



Oiaqnosis Anti-5-l>I Anti-8L 

Chronic HAWB, ivo' 0.77 4 20 

Chronic HMB, ivo l.M 5 14 

Chronic IVD 2.11 4.05 

2* 29^ AVH^, NAi*B, Sporadic 1.09 1.05 

Chronic, HANS 33.89 11.39 

Chronic, NAhB 36.22 13.67 

^- AVH, NAWB, IVD 1.90 1 54 

Chronic HAKB, Ivo 34.17 30 28 

Chronic NAXB, IVD 32.45 30.84 

* -3 1 Chronic HANB, PT* 16.09 ? 8.05 

5- -^2^ ^ote AVH KXHB, IVD 0.69 0,94 

L4te AVH HANB, IVD 0.73 0.68 

^ AVX, NANB, IVD 1.66 196 

AVH, NAJ<B, IVD 1,53 0.56 

Chronic HAWB, PT 34.40 7.55 

Chronic HAUB, PT 45-55 13.11 

Chronic HANB, PT 41.58 13.45 

Chronic NANB, PT 44.20 15.48 

9- 33^ AVH NAJ<B, IVD 3 1.92 3 1.95 

'Healed* recent 6.87 4.45 
NAWB, AVH 

9- 36 Late AVH NAHB PT 11.84 5.79 

10- 37 AVH KAUB, IVD 6.52 1.3 3 

^1- 38 Late AVH nxaZe PT 39.44 39.18 

'2. 39 Chronic MA1«B, PT 42 .22 37.54 

13. 40 AVH, NAM8, PT 1.35 1.17 

<1 Chronic NAHB? PT 0.35 0.28 
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PdC lent 



i*e I ervnce 
Number 


Diaonos is 


Anti-5-l-l 


.Anti-fi 1 


10. 42 


AVH, NA^B, IVO 


6.25 


2. 34 


IG . 4 J 


Chironic NAUB, PT 


0.74 


0.61 


17. 44 


KVH, NAWB, PT 


5.40 


1.83 


10. 4S 


Chronic, HANB, PT 


0.52 


0. 32 


19 . 46 


AVH, NAWB 


23.35 


4.15 


70 . 4 7 


AVH, Typm A 


1 .60 


1 .35 


21. 48 


AVH, Typm A 


I .30 


0 .66 


22 . 4 9 
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1.44 


0. 7< 
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R*$0lv«d R#c«nc AVM, 
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Resolved R«c«nt Avh, 
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1.04 
0.65 


26. 33 


AVH. Typ« A 
ne30iv#a R#c#nC AVH, 
Typ# A 


1.85 
1 . 02 


1 . 16 
. 0.68 


27. 54 


AVH, Typ« A 


1.35 


0 .74 


20. 55 


1*4 1« AVH, HBV 


0.58 


0.55 


29. 56 


Chronic HBV 


0.84 


1 .06 


30. 57 


Lat« AVH, HBV 


3.20 


1 .60 




cnronxc HBV 


0.47 


.0.46 


32. 59^ 


AVH, HBV 
H««l*d AVH, HBV 


0,73 
0.43 


0.60 
0.44 


33. 60^ 


AVH, HBV 
H««l«d AVH, KBV 


1.0« 
0.75 


0.92 
0.68 
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'0 . 
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AVH, HBV 
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H««lod AVH, 


HBV 
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! 7 . 
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AVH, HBV 
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.82 
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. 79 








HttAltd AVH, 


HBV 
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.53 
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. 3^ 


tS 


10 . 


65^ 


AVH, HBV 
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.95 
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.92 








H««ltd AVH, 


HBV 
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. 70 
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. 50 




jy . 




AVH, HBV 
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.03 
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.66 








H«al«d AVH, 


HBV 
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.71 
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. 39 
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Sequential serum samples available from these patients 
M VD" Intravenus Drug User 
JAVH-Acute viral hepatitis 
^ PT-Post trans fusion 
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As seen in Table i. i9 o» 32 sera from patients diagnosed as having NANBH were positive witn respect 
to antibodies directed against HCV epitopr- present in SOD-NANB? - , and SOO-NANBe^ . 

However, the serum samples which were positive were not equally immunologically reactive with SOD- 

30 NANB5-1-1 and SOD-NANBsi. Se.um samples from patient No. 1 were positive'to SOD-NANBg, but not to 
SOD-NANB5-1-1. Serum samples from patients number 10. 15, and 17 were positive to SOD-NANBc 
but not to SOO-NANBbi . Serum samples from patients No. 3. 8. n. and 12 reacted equally with both fusion 
polypeptides, whereas serum samples from patients No. 2. 4, 7. and 9 were 2-3 fold higher m me reaction 
to SOD-NANB5-.-, than to SOD-NANBst. These results suggest that NANB5.1-, and NANBs, may 

35 contain at least 3 different eoitopes: i.e.. it is possible that each polypeptide contains at least 1 unique 
epitope, and that the two polypeptides share at least 1 epitope. 
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'V D^ Specificity of the Solid Phase RIA for NANBH 



The specificity of the solid phase RIAs for NANBH was tested by using the assay on serum from 
patients infected with HAV or with HBV and on sera from control individuals. The assays utilizing partially 
purified SOONANBs-t-t and SOO-NANBit were conducted essentially as described in Section IV.D.3. 
except that the sera was from patients previously diagnosed as having HAV or HBV. or from individuals who 

45 vv-ere blood bank donors. The results for sera from HAV and HBV infected patients are presented m table i . 
Tho RIA was tested using 1 1 serum specimens from HAV infected patients, and 20 serum specimens from 
HBV infected patients. As shown in table 1. none of these sera yielded a positive immunological reaction 
with the fusion polypeptides containing BB-NANBV epitopes. 

The RIA using the NANBs-i-t antigen was used to determine immunological reactivity of serum from 

50 control individuals. Out of 230 serum samples obtained from the normal blood donor population, only 2 
yielded positive reactions in the RIA (data not shown). It is possible that the two blood donors from whom 
these serum samp:9S originated had pre/iously been exposed to HCV. 



55 IV.D.5. Reactivity of NANBs-t-i During the Course of NANBH Infection. 



The presence of anti-NANBs-i -> antibodies during the course of NANBH infection of 2 patients and 
Chimpanzees was followed using RIA as described in Section IV.D.3. In addition the RiA was usee to 
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aeiermine 'ne presence or absence ot anit-NANB' 
and HBV m'mlected chimpanzees. 

The results wh.ch are presenieo ,n TaP.e 2. show mat w.tn ch.mparu-ees and wu, hurr^anc ant. 
NANBs..-. ant.bod.es were detected toilow.ng tne onset o( the acute phase o( NANBH .nieciion" Ant.. 
NANB5-.-, ant.DOdies were not delected .n serun. samples liom ch.mpan.ees .nlecieo w.tn e.iher hav or 
HBV. Tnus ani..NANB5-.-. antibodies serve as a marker for an .nd.v.duars exposure to HCV 
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'V £- Purification of Potyclonal Serum Antibodies to NANB* . • - , 



On the basis of the specific inr^munoiogical reactivity of the SOO-NANB5-. polypeptide with the 
35 antibodies in serum samples from parents with NANBH, a method was developed to purify serum 
antibodies which react immunotog-cally with the epitope(s) in NANB5-.-1. This method utilizes affinity 
chromatography. Punfied SOO-NANBs-- polypeptide (see Section IV.D.i) was anached to an insoluble 
support: the attachment is such that the immobilized polypeptide retains its affinity for antibody to 
NANBs-!-!. Antibody in serum samples is absorbed to the matrix-bound polypeptide. After washing to 
35 remove non-specificaily bound materials and unbound materials, the bound antibody is released from the 
bound SOD-HCV polypeptide by change in pH. and'or by chaotropic reagents, for example, urea. 

Nitrocellulose membranes containing bound SOD-NANBs-i were prepared as follows. A nitrocel- 
lulose membrane. 2.1 cm Sanorius of 0.2 micron pore size, was washed for 3 minutes three times with 
BBS. SOO-NANBs-i-i was bound to the membrane by incubation of the purified preparation in BBS at 
room temperature for 2 hours: alternatively it was incubated at 4*C overnight. The solution containing 
unbound antigen was removed, and. the filter was washed three times with BBS for three minutes per wash. 
The remaining activie sites on the membrane were blocked with BSA by incubation with a 5 mg/ml BSA 
solution for 30 minutes. Excess BSA was rennoved by washing the membrane with 5 times with BBS and 3 
times with distilled water. The membrane containint^ the viral antigen and BSA was then treated with 0.05 M 
^5 glycine hydrochloride. pH 2.5. O.iO NaCI (GlyHCI) (or tS minutes, followed by 3 thrpe minute washes 
with PBS. 

Polyclonal anti-NANBs-i -1 antibodies were isolated by incubating the membranes containing the fusion 
polypeptide with serum from an individual with NANBH for 2 hours. After the incubation, the filters were 
washed 5 times with BBS. and twice with distilled water Bound antibodies were then eiuted from each filter 
5^ with 5 elutions of GlyHCI. at 3 minutes per elution. The pH of the eluates was adjusted to pH 8.0 by 
collecting each eluate in a test tube containing 2.0 M Tris HCI. pH 8.0. Recovery of the anti-NANBs-> -1 
antibody after affinity chromatography is approximately 50%. 

The nitrocellulose membranes containing the bound viral antigen can be used several limes without 
appreciable decrease in binding capacity. To reuse the membranes, after the antibodies have been eiuted 
55 the membranes are washed with BBS three times for 3 minutes. They are then stored in BBS at 4 * C. 



IV_F The Capture of hCV Particles from Infected Plasma Using Purified Human Polyclonal Anti-HCV 
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^""'^'^^'^S' "vD"d'.'Biion Of me Nucleic Aoa in the Capt .red Pan.cies lo hcv cDNA 

IV_M The Capu^ 01 HCV Parities from Iniecieg Plasma Usmg Human Polyclonal Ant-HCV Annbodies 

Protein-nucieic ac.d complexes present in mfecnous plasma ol a chimpanzee w.m NANBH were 
isolated using purified human polyclonal anii-HCV antibodies which were bound to polystyrene beads 
cr^n uS""*.' ^""•'^^'^Bs-... antibodies w,re purified from serum from a human with NANBH using the 
SOD.HCV polypeptide encoded m clone S-M. The method for purification was thai described in Section 

The purified anii-NANB,-.., antibodies were bound to polystyrene beads (1-4- diameter, specular fin 
ish. Precision Plastic Ball Co.. Chicago. Illinois) by incubating each at room temperature overnight with i ml 
of antibod.es (i m.crogram.ml in borate buffered saline, pH 8.5). Following the overnight incubation the 
beads were washed once with TBST (50 mf^ Tr.s hcI. pH 8.0. ISO ml^ NaCI. 0.05% (w, Tween 20) and 
15 then with phosphate buffered saline (PBS> containing to mg/ml BSA 

Control beads w«re prepared m an identical far-ion, t.cept that the punlied ant.-NANBs-, - antibodies 
were replaced with total human immunoglobulin. 

Capture of HCV from NANBH infected chimpanzee plasma using the anti-NANB,-. -, antibodies bound 
to beads was accomplished as follows. The plasma from a chimpanzee with NANBH used .s described in 
20 Section iv.A.i.. An aliqu-oi (1 ml) of iho NANBV infected chimpanzee plasma was incubated tor 3 hours at 
37 C with each of S beads coated with either anti-NANB,-.., antibodies, or with control immunoglobulins 
The beads were washed 3 times with TBST. 
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Hybridization of the Nucleic Acid m the. Captured Panicles to NANBV-cDNA 

The nucleic acid component released from the panicles captured with ani-NANBs - - antibodies was 
analyzed for hybridization to HCV cDNA deri- ed from clone 8i 

HCV panicles were captured from NANLh infected chimpanzee plasma, as descnbed m IV F i To 
release the nucleic acids from the particles, the washed beads were incubated for 60 min. at 37* C with 0 2 
ml per bead of a solution containing proteinase k (i mg/ml). i0 mM Tris HCI. pH 7 5 10 mM EDTA 0 25% 
(w.v) SOS. 10 micrograms/ml soluble yeast RNA. and the supernatant solution was removed The 
supernatant was extracted with phenol and chloroform, and the nucleic acids precipitated with ethanol 
overnight at -20 C. The nucleic acid precipitate was collected by centrifugalion. dried, and dissolved in 50 
mM Hepes. pH 7.5. Duplicate aliquots of the soluble nucleic acids from the samples obtained from beads 
coated with anti-NANB,..-, antibodies and with control beads containing total human immunoglobulin were 
filtered onto to nitrocellulose filters. The filters were hybridized with a «P.labeled. nick-translated probe 
made from the purified HCV cONA fragment in clone 8i. The methods for preparing the probe and for the 
hybridization are described in Section IV.C. 1 .. 

Autoradiographs of a probed filter containing the nucleic acids from particles captured by beads 
containing anti-NANB,-,-. antibodies are shown in Rg. 40. The extract obtained using the anti-NANB,--, 
antibody (A, .A,) gave clear hybridization signals relative to the control antibody extract (A,.A,) and to 
control yeast RNA (B,.8j). Standards consisting of ipg. 5pg. and lOpg of the purified, cione 81 cONA 
fragment are shown In Cl-3. respectively. 

These results demonstrate that the particles captured from NANBH plasma by anti-NANBs - - , - 
antibodies contain nucleic acids which hybridize with HCV cDNA in clone 8i. and thus provide further 
evidence that the cDN.As in these clones are derived from the etiologic agent for NANBH. 



50 IV.G. Immunological Reactivity of ClOO-3 with Purified Anti-NANBs-.-; Antibodies 

The immunological reactivity of ClOO-3 fusion polypeptide with anti-NANBs-,-, antibodies was deter- 
mined by a radioimmunoassay, in which the antigens which were bound to a solid phase were challenged 
with ounfied anti-NANBs-,., antibodies, and the antigen-antibody complex detected with '"l-iabeied sheep 
55 anti-human antibodies. The immunological reactivity of CiOO-3 polypeptide was compared with that ol SOO- 
NANBs---. antigen. 

The fusion polypeptide C 100-3 was synthesized and purified as descnbed in Section iv.B.5. and in 
Section IV.B.6.. respectively. The fusion polypeptide S0D-NAN8,-.-, was synthesized and purified as 
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aescrioed ,n Sect.on iv.B.i. and >n Section i-o i.. respectively. ,-urif,ed am.-NANB. -• am bocJies werP 
obtameo as described in Section IV.E. 

One hundred m.croliier al.qoois containing varying amounts ol punfieo C 100-3 antigen m 0 12dM Na 
borate butler. pH 8.3. 0.075M NaCI (BBS) was added to each well ol a .nicrotiier plate rOynatech immulon 2 
Removawell Strips). The plate '."as mcubaied ai 4* C overnighi in a humid chamber, alter which, the protem 
solution was removed and the wells washed 3 times with BBS containing 0.02% Triton X-iOO (BBST) To 
prevent non-specific binding, the wells were coated wiih BSA by addition o( lOO microliters o( a 5 mg,mi 
solution oJI^BSA in BBS followed by incubaticn at room temperature lor i hour, after which the excess BSa 
solution was removed. The polypeptides in the coated wells were reacted with purified ann-NANBs-, 
antibodies by adding 1 microgram antibody/well, and incubating the samples for l hr at 37' C. After 
incubation, the excess solution was removed by aspiration, and the wells were washed 5 times with BBST 
Anti-NANBs-,., bound to the fusion polypeptides was determined by the binding of -"l-labeled F'(ab)' 
sheep anti-human igG to the coated wells. Aliquots of lOO microliters of the labeled probe (specific activity 
5-20 microcuries/microgram) were added to each well, and the plates were incubated at 37'C lor i hour 
followed by removal of excess probe by aspiration, and 5 washes with BBST The amount of radioactivity 
bound in each well wa» determined by counting in a counter which detects gamma radiation. 

The results of the immunological reactivity of CiOO with purified anti-NANBs-, as compared lo that 
of NANBs-.-i with the purified antibodies are shown in Table 3. 



Table 3 



Immunological Reactivity of C 100-3 compared lo NANB5.,., by 
Radioimmunoassay 


AG(ng) 


RIA (cpm.assay) 


400 


320 


240 


160 


60 


0 


NANB5.,., 
C 100-3 


73.V 
74S0 


t732 
6985 


4954 
5920 


4050 
5593 


3051 
4096 


57 
67 



The results m Table 3 show that anti-NANBs -1 recognizes an epitope(s) m the CiOO moiety of the 
CiOO-3 polypeptide. Thus NANB5-1-1 and CiOO share a common epitope(s). The results suggest that the 
cONA sequence encoding this NANBV epitooeis) is one which is present in both clone 5-1-1 and in clone 



IV.H. Characterization of HCV 



'V H.1. Characterization of the Strandedness of the HCV Genome. 



The HCV genome was characterized with respect to its strandedness by isolating the nucleic acid 
fraction from particles captured on anti-NANBs-.-. antibody coated polystyrene beads, and deterr.;:r.;r.g 
whether the isolated nucleic acid hybridized with plus and/or minus strands of HCV cONA, 

Particles were captured from HCV infected chimpanzee plasma using polystyrene beads coated with 
immunopurified anti-NANBs -1 -i antibody as described in Section IV.F.1. The nucleic acid component of 
the particles was released using the method described in Section IV.F.2. Aliquots of the isolated genomic 
nucleic acid equivalent to 3 mis of high titer plasma were blotted onto nitrocellulose niters. As controls, 
aliquots of denatured HCV cDNA from clone 81 (2 picograms) was also blotted onto the same filters. The 
niters were probed with «P-labeled mixture of plus or mixture of minus strands of single stranded DNA 
cloned from HCV cONAs: the cDNAs were excised from clones 40b. 81. and 25c. 

The single stranded probes were obtained by excising the HCV cDNAs from clones 81. 40b. and 25c 
with EcoRl. and cloning the cONA fragments in M13 vectors. mpi8 and mpi9 (Messing (1983)). The Mi3 
clones were sequenced to determine whether they contained the plus or minus strands of DNA derived 
from the HCV cDNAs. Sequencing was by the dideoxychain termination method of Sanger et al. (1977). 

Each of a set of duplicate niters containing aliquots of the HCV genome isolated from the captured 
panicles was hybndized with either plus or minus strand probes derived from the HCV cONAs. Fig. 4i 
shows the auioradiographs obtained from orobmg ihe NAN8V genome with the mixture of probes cJer-veo 
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Irom Clones 3i. 400, and 25o Tms m,x.u,e was usea .o mcredce me sensM.v.iy ol me nybr,o.--aMon 
The samples m panel i we.e hyb..di2ea w.th mi pius slrano probe .nuiure. The samples m panel il .v, 
prooeci by nybndization w.ih ine m.nus suand p,obe mixture The composn.on ol ihc samples .n the pan, 
ol the immunobioi are piesenied if. table 4. 



Table 4 



lane 


A 


B 


1 


HCV genome 




2 






3 




cONA 81 


4 




CDNA 81 



* IS an undescribed sample. 



As seen from the re;>^ils m Fig. 41. only me minus strand ONA probe hybrrdizes w.th the .soiated HCV 
genome. This result, m combination with the result showing that the genome is sensitive to RNase and not 
DNase (See Section IV.C.2.). suggests that the genome ol NANBV is positive stranded RNA. 
20 These data, and data from other laboratories concerning the ohysicochem.cal properties ol a putative 
NAN8V(s). are Consistent with the possibility that HCV is a member of the Flaviviridae. However, the 
possibility that HCV represents a new class of viral agent has not been eliminated. 

2S 1V.H.2. Detection of Sequences tn Captured Particles 

Which When Amplified Dy 'PCR Hypndize to HCV cONA Derived from Clone 8i 

The RNA .n captured particles was obtained as described »n Section IV.H.I. The analysis for sequences 
which hybridize to the hCV cDNA derived f- -r,, clone 81 was earned out util.rmg the PCR amplification 
30 procedure, as described m Section IV.C.3. except that the hybridization probe was a kmased oligonucleotide 
derived from the clone 81 cDNA sequence. The results showed that the amplified sequences hybudized 
With the done 8i derived HCV cDNA probe. 

'^•^•3- Homology Between the Non-Structural Protein of Dengue Flavivirus (MNWWVOi) and the HCV 
J5 Polypeptides Encoded by the Combined ORF of Clones 1 4i Through 39c 



The com.'jined HCV cDNAs of clones I4i through 39c contain one continuous ORF. as shown in Fig. 26. 
The polypeptide encoded therein was analyzed for sequence homology with the region of the non-structural 
polypeptide(s) in Dengue flavivirus (MNWVDi). The analysis used the Dayhoff protein data base, and was 
performed on a computer. The results are shown in Rg. 42. where the symbol (:) indicates an exact 
homology, and the symbol (.) indicates a conservative replacement in the sequence: the dashes indicate 
spaces inserted into the sequence to achieve the greatest homologies. As seen from the figure, there is 
significant homology between the sequence encoded in the HCV cDNA. and the non-structural polypeptide- 
(s) of Dengue virus. Irt addition to the homology shown in Fig. 42. analysis ol the polypeptide segment 
encoded in a region towards the 3'-end of the cDNA also contained sequences which are homologous to 
sequences in the Dengue polymerase. Of consequence is the finding that the canonical Gly-Asp-Asp (GDD) 
sequence thought to be essential for RNA-dependent RNA polymerases is contained in the polypeptide 
encoded in HCV cONA. in a location which is consistent with that in Dengue 2 virus. (Data not shown.) 



IV.H.4. HCV-DNA is Not Detectable in NANBH Infected Tissue 

Two types of studies provide results suggesting that HCV-DNA is not detectable in tissue from an 
individual with NANBH. These results, in conjunction with those described in IV.C. and IV.H.i. and IV.H.2. 
provide evidence that HCV is not a DNA containing virus, and that its replication does not involve cDNA. 



'V.H.4.a. Southern Bloning Procedure 



52 



J lO ^ 1 0 A 1 



^ in oiCet 10 determme whemer NANBH mtecied chimpjnzee iiv^r contains aeteciaole hCV-Dna (or 
HCV-cDNA). restriction dnzyme fragments ol DN^ isolated Irom tn.s source was Southern biottea. and the 
blots probed w.ih ^-P-iabeied HCV cDNA. The results snowed that the labeled HCV cONA did not hyondiie 
to the blotted ONA from me mfecied chimpan.?ee liver, it also did not hybridize lo control blotted DNA from 
normal chimpanzee liver. In contrast, in a positive control, a labeled probe of the beta-inierferon gene 
hybridized strongly to Southern blotr ol restriction enzyme digested human placental DNA. These systems 
were designed to detect a smgie copy of the gene which was to be detected with the labeled probe. 

DNAs were isolated from the livers of two crimpanzees with NANBH. Control DNAs were isolated from 
uninfected chimpanzee liver, and from human placentas. The procedure for extracting DNA was essentially 
according lo Maniatis et al. (1982). and the DNA samples were treated with RNAse during the isolation 
procedure. 

Each ONA sample was treated with either EcoRI. Mboi. or HinCIl (n micrograms), according to the 
manufacturer's directions. The digested DNAs were electrophoresed on 1% neutral agarose gels. Southern 
blotted onto nitrocellulose, and the blotted material hybridized with the appropriate mck-transtated probe 
cDNA (3 X 10' cpm/ml of hybridization mix). The DNA from infected chimpanzee liver and normal liver were 
hybridized with J>T.iabe'<»'< HCV cDNA from clones 36 plus 8i: the DNA from human placenta was 
hybridized with ^^P^labeled DNA from the beta-interferon gene. After hybridization, the blots were washed 
under stringent conditions, i.e.. with a solution containing 0.1 x SSC. 0.1% SOS. at 65' C. 

The beta-interleron gene DNA was prepared as described by Houghton et al (i98i). 



IV H.4.b. Amplification by the PCR Technique 



In order to determine whether HCV-DNA could be detected in liver from chimpanzees wuh NANBH. 
DNA was isolated from the tissue, and subjected to the PCR amplification-detection technique using 
primers and probe polynucleotides derived from HCV cDNA from clone 81. Negative controls were DNA 
samples isolated from uninfected HepG2 tissue culture cells, and from presumably uninfected human 
placenta. Positive controls were samples of \t>i negative control DNAs to which a known relatively small 
amourii (250 molecules) of the HCV cONA insen from clone 81 was added. 

In addition, to confirm that RNA fractions isolated from the same livers of chimpanzees with NANBH 
contained sequences complementary to the HCV-cDNA probe, the PCR amplification-detection system was 
also used on the isolated RNA samples. 

In the studies, the DNAs were isolated by the procedure described in Section IV.H.4.a. and RNAs were 
extracted essentially as described by Chirgwin et al. (1981). 

Samples of DNA were isolated from 2 infected chimpanzee livers, from uninfected HepG2 cells, and 
from human placenta. One microgram of each DNA was digested with Hindlll according to the manufac- 
turer's directions. The digested samples were subjected to PCR amplification and detection for amplified 
HCV cDNA essentially as described in Section IV.C.3.. except that the reverse transcriptase step was 
omitted. The PCR primers and probe were from HCV cDNA clone 81. and are described m Section IV.C.3.. 
Prior to the amplification, lor positive controls, a one microgram sample of each DNA was "spiked" by the 
addition of 250 molecules of HCV cDNA insen isolated from clone 8i. 

In order to determine whether HCV sequences were present in RNA isolated from the livers of 
chimpanzees with NmNBH. samples containing 0.4 micrograms of total RNA were subjected to the 
amplification procedure essentially as described in Section IV.C.3.. except that the reverse transcriptase 
was omitted from some of the samples as a negative control The PCR primers and probe were from HCV 
cDNA clone 81. as described supra. 

Tne results showed that amplified sequences complementary to the HCV cDNA probe were not 
detectable in the DNAs from infected chimpanzee liver, nor were they detectable in the negative controls. In 
contrast, when the samples, including the DNA from infected chimpanzee liver, was spiked with the HCV 
cDNA prior to amplification, the clone 81 sequences were detected in all positive control samples. In 
addition, in the RNA studies, amplified HCV cONA clone 81 sequences were deieaed only when reverse 
transcriptase was used, suggesting strongly that the results were not due to a DNA contamination. 

These results show that hepatocytes from chimpanzees with NANBH contain no. or undetectable levels, 
of HCV DNA. Based upon the spiking study, if HCV DNA is present, it is at a level far below .06 copies per 
hepatocyte. In contrast, the HCV sequences m total RNA from the same liver samples was readily detected 
with the PCR technique. 
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'V ' ELISA Di^idimmai.Qfis lo^ hCV Inloctio/i Ustng hCV c 100-3 As Tcsi Anm 



All samples were assayed using the HCV ciOO-3 ELlSA This assay ut.ii.-es ihe hCV ciOO-3 ar.ugen 
(which was synthesized and punhed as doscnbeo m Section IV B 5). and a riorseraa.sn pofO:<idase fHRP) 
5 coniugate of mouse monoclonal anti-human igG. 

Plates coated with tr HCV ciOO-3 antigen were prepared as follows. A solution containing Coatmg 
buffer (50mM Na Borate. pH 9 0). 21 ml/plate. BSA (25 inicfoyrams ml). ciOO-3 (2.50 micrograms ml) was 
prepared just prior to addition to the Removeawell Immulon I plates (Oynatech Corp.). After mixing for 5 
minutes. 0.2ml/weir of the solution was added to ihe plates, they were covered and incubated for 2 hours at 
w 37 C. after Which the solution was removed by aspiration. The wells were washed once with 400 microliters 
Wash Bfffer (100 mM sodium phosphate. pH 7.4. 140 mM sodium chloride. 0.1% (VW) casein. i% (WV) 
Triton x-tOO. 0.01% (W/V) Thimerosal). After rumovai ol the wash solution. 200 microliters/ well of Postcoat 
solution (10 mfv^ sodium phosphate. pH 7.2. 150 mM sodium chloride. 0.1% (wv) casein and 2 mM 
pnenyimemyisuifonylfiuoride (PMSF)) was added, the plates were loosely covered to prevent evaporation. 
'5 and were allowed to stand at room temperature for 30 minutes. The wells were then aspirated to remove 
the solution, and lyophilized dry overnight, wiihout shelf heating. The prepared plates may be stored at 2- 
d C m sealed aluminum ^juches. 

In nrrtftf io perform the ELISA determination. 20 microliters of serum sample or control sample was 
added to a well containing 200 microliters of sample niluent (lOO mM sodium phosphate. pH 7 4. 500 mlvi 
20 ^sodium chloride, i mM EDTA. 0.1% (W^V) Casem. 0.015 (WA/) Therosal. J% (WA/) Triton X-lOO. lOO 
micrograms-ml yeast extract). The plates were sealed, and incubated at 37*C for two hours, after which the 
solution was removed by aspiration, and the wells were washed with 400 microliters of wash bufler 
(phosphate buffered saline (PBS) containing 0,05% Tween 20). The washed wells were treated with 200 
microliters of mouse anti-human igG-HRP conjugate contained m a solution of Ortho conjugate diluwnt (10 
:5 mM sodium phosphate. pH 7.2. 150 mM sodium chloride. 50% (VV) fetal bovine serum. i% (VV; heat 
ireated horse serum, l mM KjFetCN)^. 0.05% (WV) Tween 20. 0.02% (WV) Thimerosal). Treatment was 
for 1 hour at 37 C. the solution was removed by aspiration, and the wells were washed with wash buffer, 
which was also removed by aspiration. To determine the amount of bound enzyme conjugate. '200 
microliters of substrate solution ( 10 mg O-phe^/t ?nediamine dihydrochloride per 5 ml of Developer solution) 
30 was added. Developer solution contains 50 mM sodium citrate adjusted to pH 5.1 with phosphoric acid, and 
0.6 microliters'ml of 30% H^O:- The plates containing the substrate solution were incubated in the dark for 
30 minutes at room temperature, the reactions were stopped by the addition of 50 microliters- ml 4N sulfuric 
acid, and the ODs determined. 

The examples provided below show that the microliter plate screening ELISA which utilizes HCV c 100-3 
J5 antigen has a high degree of specificity, as evidenced by an initial rate of reactivity of about 1"^. with a 
repeat reactive rate of about 0.5% on random donors. The assay is capable of detecting an im- 
munoresponse in both the post acute phase of the infection, and during the chronic phase of the disease, in 
addition, the assay is capable of delecting some samples which score negative in the surrogate tests for 
NANBH: these samples come from individuals with a history of NANBH. or from donors implicated in 
40 NANBH transmission. 

In the examples described below, the following abbreviations are used: 
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JO 



ts 



20 



Al T 


Alanine ammo iransferdse 


Anti-HBc 


Annbody agamsi HBc 


Anii-HBsAg 


Antibody agamsi HBsAg 


HBc 


Hepatitis B corn antigen 


ABsAg 


Hepatitis B suHnre antigen 


IgG 


Immunoglobulin G 


tgM 


Immunoglobulin M 


lU'L 


International units- Liter 


NA 


Not available 


NT 


Not tested 


N 


Sample size 


Neg 


Negative 


00 


Optical density 


Pos 


Positive 


SCO 


Signal/cuiorf 




Standard deviation 


X 


Average or mean 


WNL 


Within normal limits 
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IV 



1.1 HCV Infection in a Population of Random Stood Donors 



00 



35 



A group of 1.056 samples (fresh sera) from random blood donors were obtajned Uom irwm Memorial 
Blood Bank. San Francisco. California. The test results obtained with these samples are summarized m a 
histogram showing the distribution of the OD values (Fig. 43). As seen in Fig. 43. 4 samples read >3. i 
sample reads between 1 and 3. 5 sample^* - ead between 0.4 and 1, and the remaining 1.046 samples read 
<0.4. with over 90% of these samples reading <0.1. 

The results on the reactive random samples are presented in Table 5. Using a cut-off value equal to the 
mean plus 5 standard deviations, ten samples out of the 1.056 (0.95%) were initially reactive. Of these, five 
samples (0.47%) repeated as reactive when they were assayed a second time using the ELISA. Table 5 
also shows the ALT and Anti-HBd status for each of the repeatedly reactive samples. Of particular interest 
IS the fact that all five repeat reactive samples were negative in both surrogate tests for NANBH. while 
scoring positive in the HCV ELlSA. 
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TABLE 5 



RESULTS ON REACTIVE RANDOM SAMPLES 


N = 1051 










X = 0.049- 








SO = 1 0.074 










* 5S0 = 0,419 ;o.400 + Negative Control) 




Samples 


Initial Reaciives 00 


Repeat Reactives 


ALT - 


Anti HBc 






00 


(lU/L) 


(00) 




0.462 


0.084 


NA 


NA 


6292 


0.569 


0.294 


NA 


NA 


6188 


0.699 


0.326 


NA 


NA 


6157 


0.735 


0.187 


NA 


NA 




0.883 


0.152 


NA 


NA 


6397 


1.567 


1.392 


30.14 


1.433 


6019 


> 3.000 


'•3.000 


46.4B 


1.057 


6651 - 


> 3.000 


> 3.000 


48.53 


1.343 


6669 


> 3.000 


> 3.000 


60.53 


1.165 


4003 


> 3.000 


3.000 


WNL"~ 


Negative 




10.' 1056 = 0.95% 


5 '1056 = 0-47% 







' Samples reading > 1.5 were not included in calculating the Mean and SO 
" ALT £ 68 lU L is above normal limits. 



Anti-HBc S 0.535 (competition assay) ts considered positive. 
WNL; Within normal limits. 



IV I.2. Chimpanzee Serum Samples 

Serum samples from eleven chimpanzees were tested with the HCV ci00-3 ELlSA. Four of these 
chimpanzees were infected with NANBH from a contaminated batch of Factor VIII (presumably Hutchinson 
strain), following an established procedure in a collaboration with Or. Oaniel Bradley at the Centers for 
Oisease Control. As controls, four other chimpanzees were infected with HAV and three with HBV. Serum 
samples were obtained at different times after infection. 

The results, which are summarized in Table 6. show documented antibody seroconversion in all 
chimpanzees infected with the Hutchinson strain of NANBH. Following the acute phase of infection (as 
evidenced by the significant rise and subsequent return to normal of ALT levels), antibodies to HCV c 100-3 
became detectable in the sera of the 4/4 NANBH infected chimpanzees. These samples had previously 
been shown, as discussed in Section IV.B.3.. to be positive by a Western analysis, and an RIA. In contrast, 
none of the control chimpanzees which had been infected with HAV or HBV showed evidence of reactivity 
in the ELISA. 



56 



Ch- U 318 21b Ai 



P .'^JX 6 



10 



NCCILIVC CONTIIOL 

PosiTivc Control 
Cuiof f 



J2L 



0. 001 

1. son 

0.401 



IvOCULAT ION 
0*TC 



BlCCD 



^5 



Chi mo 1 



Chi mo 2 



20 



?5 



30 



Chimp 3 



Chimo 4 



Chi no 5 



35 



40 



Qhimo 6 



Chimo 7 



-0.007 
0.003 
>3.000 
>3.U00 



-0.005 
-U.OOS 
0.915 
>5.000 

0.017 
O.OOG 
l.OtO 



45 



-0.006 
0.005 
0.523 
l.57'» 

-0.006 
0.001 
0.005 
0.006 



-0-005 
0.001 

-0.00< 
0.290 

-0.008 
-0.000 
-0.006 
0.005 



0.00 
0.01 

>),nz 



o.co 

0.00 
2.3C 
>7.18 

O.OI 
0.00 
0.01 
2.52 

0.00 
O.ji 
1. 31 
5.93 

0.00 
0.00 
0.01 
0.01 



0.00 
0.00 
0.00 
0.72 

0.00 
0.00 
0.00 
0.01 



05/24/84 



06/07/84 



05/24/84 



03/11/85 



03/11/85 



11/21/80 



05/25/82 



05/25/52 



08/07/81 


71 


OS/18/84 


19 


10/71/80 






05/31/84 


— 
5 


06/28/84 


52 


08/20/84 


13 


10/24/84 


— — — 


03/14/85 


8 


04/26/85 


205 


05/06/85 


14 


08/20/85 


c 

D 


03/11/85 


11 


05/09/85 


132 


06/06/85 




08/01/85 




11/21/80 


4 


12/16/SO 


147 


12/30/80 


18 


07/29 - OS/21/81 


5 


05/ J 7/82 




OG/IO/82 


lOG 


07/06/82 


to 


10/01/82 




05/25/82 


7 


06/17/82 


83 


09/16/82 


5 


10/09/82 





Hm 



nm 



HAV 



KAV 



HAV 
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55 



57 



QilXPANZIX SE3m SAMPLES 
(Cont'd) 

Ihucuiaiion Bicco ALT 



IIAV 



MBV 



^^-"^P Q -o.oo; 0.00 ii/2i/:o 11/21/80 is 

0.000 0.00 l2/16/ro IJO 

0.00^ 0.01 02/U5/81 3 

0.000 0.00 06/03 - Of/10/81 «.5 

Chimp 9 — — Q7m/m — — 

0.019 0.05 08/72 - 10/10/79 

OJ/ll/81 57 

0.015 0.01 07/01 - 08/05/81 9 

0.008 0.02 10/01/81 6 

Chimp 10 ... ... 05/12/82 

0.011 0.03 01/21 - 05/12/82 9 

0,015 0.01 09/01 - 09/08/82 126 

0.008 0.07 12/02/82 9 

0.010 0.02 01/06/83 15 

Chirr.p 1 1 — — 05/12/82 — — HOV 

0.000 0.00 Ol/OC - 05/12/82 ll 

00/23/82 100 

-0,005 0.00 06/09 - 07/07/82 

-0.003* 0.00 10/78/82 9 

-0.003 0.00 12/20/82 10 



llOV 



'-3. Panel Proven Infectious Sera from Chronic Human NANBH Carriers 



A coded panel consisted of 22 unique samples, each one in duplicate, for a total of 44 samoles. The 
samples were from proven infectious sera from chronic NANBH carriers, infectious sera from implicated 
donors, and infectious sera from acute phase NANBH patients. In addition, the samples were from highly 
pedigreed negative, controls, and other disease controls. This panel was provided by Or, H. Alter of the 
Department of Health and Human Services. National Institutes of Health. Bethesda, f^aryland. The pan«i 
was constructed by Dr. Alter several years ago. and has been used by Dr. Alter as a qualifying panel for 
putative NANBH assays. 

The entire panel was assayed twice with the ELISA assay, and the results were sent to Or. Alter to be 
scored. The results of the scoring are shewn in Table 7. Although the Table reports the results of only one 
set of duplicates, the same values were obtained for each of the duplicate samples. 

As shov.-n in Table 7. 6 sera which were proven infectious in a chimpanzee model were strongly 
positive. The seventh infectious serum corresponded to a sample for an acute NANBH case, and was not 
reactive in this ELISA. A sample from an implicated donor with both normal ALT levels and equivocal 
results in the chimpanzee studies was non-reactive in the assay. Three other serial samples from one 
individual with acute NANBH were also non-reactive. All samples coming from the highly pedigreed 
negative controls, obtained from donors who had at least i0 blood donations without hepatitis implication, 
were non-reactive in the ELISA. Rnally. four of the sampleis tested had previously scored as positive in 
putative NANBH assays developed by others, but these assays were not confirmable. These four samples 
scored negatively with the HCV ELISA. 
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TABLE 7 





H. ALTER'S PANEL l 


5 


PANEL 


1ST 
RESULT 


2N0 
RESULT 


- 


1) PROVEN INFECTIOUS BY CHIMPANZEE 
TRANSMISSION 


10 


A. CHRONIC NANS: POST-TX 




JF 
EB 
PG 


+ 






B. IMPLICATEO DONORS WITH ELEVATED 
ALT 


:c 


BC 

JJ 

BB 




•4' 




C. ACUTE NANS: POST-TX 




WH 








2) EQUIVOCALLY INFECTIOUS BY 
CHIMPANZEE TRANSMISSION 




A. IMPLICATED DONOR WIH NORMAL ALT 




CC 




30 


3) ACUTE NANB: F'OST-TX 




JL WEEK 1 
JL WEEK 2 
JL WEEK 3 






3S 


4) DISEASE CONTROLS 




A. PRIMARY BILIARY CIRRHOSIS 




EK 






40 


B. ALCOHOLIC HEPATITIS IN RECOVERY 




HB 








5) PEDIGREED NEGATIVE CONTROLS 




DH 
DC 
LV 
HL 
AH 








6) POTENTIAL NANB "ANTIGENS" 


50 p 


JS-80-OIT-0 (ISHIDA) 
ASTERIX (TREPO) 
ZURTZ (ARNOLD) 
BECASSOINE (TREPO) 
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'V '^ Panfti 2; Donor/Recipieni NANBH 
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The codeo panel consisied ol lO unequivocal aonof-fecipieni cases transfusion associaiea Nanbh 
w.ih a loidi ol 188 samples. Eacn case cor sisted ol samples ol some or all the donors to the reop.ent. ana 
ol serial sample? (drawn 3. 6. ana \2 months alter iransluston) from the reciprent. Also included was a p'e- 
Dieed. drawn Irom the reciptont belore iranslusion. The coded panel was provided by Dr. H. Aiier. from tne 
NiH. and the results wpre sent to him lor sconng. 

The results, which are summarised m Table 8. show that the ELlSA detected antibody seroconversion 
in 9 oMO cases or iranslusion associated NANBH. Samples from case 4 (where no seroconversion was 
detected), consistently reacted poorly in the ELiSA. Two of the 10 recipient samples were reactive at 3 
months^posi translusion. At six months, & recipient samples were reactive: and at twelve months, with the 
exception ol case 4. all samples were reactive. In addition, at least one antibody positive donor was found m 
7 out of the 10 cases, with case lO having two positive donors. Also, in case lO. the recipient's pre-bieed 
was positive lor HCV antibodies. The one month bleed from this recipient dropped lo borderline reactive 
levols. while It was elevated to positive at 4 and 10 month bleeds. Generally, a S'CO ol 0.4 is considered 
positive. Thus, this case may represent a prior inleciion of the individual with HCV. 

The ALT and HBc status lor alt the reactive, i.e.. positive, samples are summarised m Table 9. As seen 
in the table. 1.8 donor samples was negative for the rjrrogate markers and reactive m the HCV antibody 
ELISA. On the otht?r hand, the recipient samples (followed up to 12 months afier transfusion) had either 
elevated ALT. positive Anti-HBc. or both. 



TABLE 8 



DONOR'RECIPIENT NANB PANEL 



H. ALTER DONOR.RECIPIENT NANB PANEL 



CASE 


DONOR 


RECIPIENT . 
PREBLEED 


POST-TX , 


00 


S'CO 


OD 


SCO 


3 MONTHS 


6 MONTHS 


12 MONTHS 


OD 


SCO 


00 


S/CO 


OD 


SCO 


1. 






032 


0.07 


.112 


0.25 


> 3.000 


>6.96 


> 3.000 


>6.96 


2. 






.059 


0.14 


.050 


0-12 


1.681 


3.90 


> 3.000 


>6.96 


3. 


.403 


0-94 


.049 


O.n 


.057 


0.13 


> 3.000 


>6.96 


> 3.000 


>6.96 


4. 






.065 


0.15 


.073 


0.17 


.067 


0.16 


.217 


0.50 


5. 


> 3.000 


>6.96 


.034 


0.08 


.096 


0.22 


> 3.000 


>6.96 


> 3.000 


>6.96 


6. 


> 3.000 


>6.36 


.056 


0.13 


1.475 


3.44 


> 3.000 


>6.96 


> 3.000 


>6.96 


7. 


> 3.000 


>6.96 


.034 


0.08 


-056 


0-13 


> 3.000 


>6.96 


>^.000 


>6.96 


8. 


> 3.000 


>5.96 


.061 


0.14 


.078 


0.18 


2.262 


5.28 


> 3.000 


>6.96 


9. 


> 3.000 


>5.96 


.080 


0.19 


.127 


0.30 


.055 


0.13 


> 3.000 


>6.96 


10. 


> 3.000 


>6.96 


> 3.000 


>6.96 


.3ir 


0.74 


> 3.000- 


>6.96 


> 3.000- 


>6.96 




> 3-000 


>6.96 



















• 1 MONTH. 
" 4 MONTHS. 
10 MONTHS 
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Table 9 



20 



ALT AND HSC STATUS FOR REACTIVE SAMPLES IN 




H. ALTER PANEL 1 




Samples 


Anti-ALT* 


HBc- 


Donors 




Case 3 




Normal 


Negative 


Case S 




Elevated 


Positive 


Case 6 




Elevated 


Positive 


Case 7 




Not available 


Negative 


Case 8 




Normal 


Positive 


Case 9 




Elevated 


Not available 


Case to 




Normal 


Positive 


C;ise 10 




1 Normia' 


1 Positive 


Reciptents 


Case 1 


6 mo 


Elevatbd 


Positive 


12 mo 


Elevated 


Not tested 




Case 2 


6 mo 


Elevated 


Negative 


12 mo 


Elevated 


Not tested 


Case 3 


6 mo 


Normal 


Not tested"" 


12 mo 


Elevated 


Not tested" 




Case 5 


6 mo 


Elevated 


Not tested 


12 mo 


Elevated 


Not tested 




Case 6 


3 mo 


Elevated 


Negative 


6 mo 


Elevated 


Negative 




12 mo 


Elevated 


Not tested 




Case 7 


6 mn 


Elevated 


Negative 


12 mo 


Elevated 


Negative 




Case 8 


6 mo 


Normal 


Positive 


12 mo 


Elevated 


Not tested 




Case 9 


12 mo 


Elevaidd 


Not tested 


Case 10 


4 mo 


Elevated 


Not tested 


10 mo 


Elevated 


Not tested 





' ALT ^45 lU/L is above normal limits. 
" Anti^HBc S50% (competition assay) is considered positive. 
P-'ebleed and 3 mo samples were negative for HBc. 



'V ''5- Determination of HCV Infection in High Risk Group Samples 



Samples from high risk groups were monitored using the ELISA to determine reactivity to HCV c 100-3 
antigen. These samples were obtained from Dr. Gary Tegtmeier. Community Blood Bank. Kansas City. The 
results are summarized in Table 10. 

As shown in the table, the samples with the highest reactivity are obtained from hemophiliacs (76%). In 
addition, samples from individuals with elevated ALT and positive for Anti-HBc. scored 51% reactive, a 
value which is consistent with the value expected from clinical data and NANBH prevalence in this group. 
The incidence of antibody to HCV was also higher in blood donors with elevated ALT alone, blood donors 
positive for antibodies to Hepatitis B core alone, and in blood donors rejected for reasons other than high 
ALT or anti-core antibody when compared to random volunteer donors. 
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TABLE »0 



NANBH HIGH RISK GROUP SAMPLES 



Group 


IN 


Oistridution 


% Reactive 


N 


OD 




-Elevated ALT 


1 c 

35 


3 


> 3.000 


1 1 .4% 


1 


0.728 








Anii-HRr 


24 


5 


> 3.000 


20.8% 


Elevated ALT Ann-HRr 


33 


12 


> 3.000 


51.5% 




TOO 










2.324 










0.939 










0.951 




















25 


5 


> 3.000 


20.0% 


DonAfC with Hictnru rt' WA^fatit.o 


1 50 


19 


> 3.000 


14.7*»'o 




0.837 










U.7 I 4 










0.469 








HasmnnhiliArc 


50 


31 


> 3.000 


76.0% 




2.568 










2.483 










2.000 










1.979 










1.495 










1.209 










0.819 









^ Comparative Studies Using Anti-lgG or Anii-lgM Monoclonal Antibodies, or Polyclonal Antibodies as a 
Second Antibody in the HCV ciOO-3 EUSA 

The sensitivity of the ELISA deternr»ination which uses the anti-igG monoclonal conjugate was compared 
to that Obtained by using either an anti-IgM monoclonal conjugate, or by replacing both with a polyclonal 
antiserum reponed to be both heavy and light chain specified. The following studies were penormed. 

'V l-6.a. Serial Samples from Seroconveners 

Serial samples from three cases o( NANB seroconveners were studied in the HCV ciOO-3 ELiSA assay 
using in the enzyme conjugate either the anti-IgG monoclonal alone, or in combination with an antl-igM 
monoclonal, or using a polyclonal antiserum. The samples were provided by Dr. Cladd Stevens. N.Y. Biood 
Center. N.Y.C.. N.Y.. The sample histories are shown in Table 11. 

The results obtained using an anti-IgG monoclonal antibody-enzyme conjugate are shown in Table 12 
The data shows that strong react; "ty is initially detected in samples 1-4. 2-8. and 3-5. of cases 1 2 and 3 
respectively. 

The results Obtained using a combination of an anti-igG monoclonal conjugate and an anti-lg(^ 
con,ugate are shown in Table 13. Three different ratios of anti-IgG to anti-IgM were tested: the i 10 000 
dilution of anti-igG was constant throughout. Dilutions tested for the anti-lgfvj monoclonal conjugate were 
1:30.000. 1:60.000. and 1:120,000. The data shows that, in agreement with the studies with anti-IgG alone 
initial strong reacuvity is detected in samples 1-4. 2-8. and 3-5. 

The results. obtained with the ELISA using anti-IgG monoclonal conjugate (1:10.000 dilution) or Tago 
polyclonal conjugate (1:80.000 dilution), or Jackson polyclonal conjugate (1:30.000 dilution) are shown ,n 
Table 14. Tne data indicates that initial strong reactivity is detected in samples 1-4. 2-8. and 3-5 using all 
three configurations: the Tago polyclonal antibodies yielded the lowest signals. 
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Tho .esuiis p.cseniocl aoove show ,hai all th.ea conh.ji-aiions deiect ,aaci,ve samples .v me same iim^ 
alie. ine acuie pnase oi ihe o.^aase (as ev.dent >d by ih« ALT elevahon,. Moreove,. the -esuits md.caie ma, 
tne sons.t,v„y ol ,no HCV clOOO ELISA us.ng ani-igG monociona.-en.yme conjugate -s equal ,o or .^et.e 
!han that obiameJ using the other tested configurations lor the enzyme con,ugaie. 

TABLE 11 

DESCRIPnON or SAMPLES FRQM_CLADD STEVENS PANEL 



JO 



J5 



0<t t 



HltAi 



20 



1-1 
1-2 

1-4 
1-5 



i/3/ai 
t/2/8: 

10/7/«i 
12/13/11 



1,0 


91.7 


12.9 


40.0 


-l.O 


1.0 


121.0 


13.1 


274 .0 


1.4 


1.0 


M.O 


23.1 


261.0 


0.9 


1.0 


47.3 


33. 8 


75.0 


0.9 


1.0 


50.3 


27.6 


^71.0 


1.0 



30 



35 



2-1 


10/19/81 


1.0 


1.0 


2-2 


11/17/11 


1.0 


0.8 


2-3 
2'A 
2-5 


12/02/81 


1.0 


1.2 


12/14/81 


1.0 


0.9 


12/23/81 


1.0 


0.3 


.:-6 


1/20/82 


1.0 


0.8 


2-7 


2/13/12 


1.0 


0.8 


2-8 


3/17/82 


1.0 


0.9 


2-9 


6/21/82 


1.0 


0.9 


2-10 


3/I9/I2 


1.0 


0.9 


2-11 


4/U/82 


1.0 


0.8 



116.2 
89.5 
78.3 
90.6 
93.6 
92.9 
86.7 
69.8 
67.1 
74.8 
82.9 



17.0 
46.0 
63.0 
152.0 
(24.0 
66.0 
70.0 
24.0 
33.0 
95.0 
37.0 



-1.0 
1. 1 

1.4 
1.4 
1.7 
1.5 
1.3 
-1.0 
1.5 
1.6 
-1.0 



40 



C4f 3 



50 



3-1 

3-2 
3-3 

3-4 
3-5 
3-6 
3-7 
3-8 
3-9 



4/7/81 

3/12/81 

3/30/81 

4/9/tl 

7/4/11 

•/io;:i 

9/S/81 

10/14/81 

11/11/81 



1.0 


1.2 


88.4 


13.0 


-1.0 


1.0 


1.1 


126.2 


236.0 


0.4 


1.0 


0.7 


99.9 


471.0 


0.2 


1.0 


1*2 


110. s 


315.0 


0.4 


1.0 


1.1 


89.9 


273.0 


0.4 


i.O 


1.0 


118.2 


158.0 


0.4 


1.0 


1.0 


112.3 


84.0 


0.3 


1.0 


0.9 


102.3 


180.0 


0.3 


1.0 


1.0 


84.6 


154.0 


0.3 



55 



63 



TABLE 12 



ELISA RESULTS OBTAINED USING AN ANTl-lgG 


MONOCLONAL CONJUGATE 




SAMPLE 


DATE 


ALT 


OD 


S'CO 


NEG CONTROL 






.076 




CUTOFF 






.476 




PC (1:120) 






1.390 




CASE#l 




1-1 


08/05/81 


40.0 


.178 


.37 


1-2 


09/02/81 


274.0 


.154 


.32 


1-3 


10/07/81 


261.0 


.129 


.27 


1-4 


n '19/81 


•75.0 


.937 


1.97 


'■5 


12/15/81 


7-..0 


> 3.000 


>6.30 


CASE n 


2-1 


I0''l9/8i 


17.0 


.058 


0.12 


2-2 


1 VI 7/81 


46.0 


.050 


0.11 


2-3 


12/02/81 


63.0 


.047 


0.10 


2-4 


12M4/81 


152,0 


.059 


0.12 


2-5 


12/23/81 


624.0 


.070 


0.15 


2-6 


01-20/82 


66.0 


051 


0.11 


2-7 


02-1 5 '82 


70.0 


.139 


0.29 


2-8 


03/17/82 


24.0 


1.867 


3.92 


2-9 


04/21/82 


53.0 


> 3.000 


>6.30 . 


2-10 


0'-v'l9'82 


95.0 


> 3.000 


>6.30 


2-11 


00-14/82 


37.0 


> 3.000 


>6-30 


CASE n 


3-1 


04/07/81 


13.0 


.090 


.19 


3-2 


05/U'8l 


236.0 


.064 


.13 


3-3 


0S30/81 


471.0 


.079 


.17 


3-4 


06/09/81 


315.0 


.211 


.44 


3-5 


07/06/81 


273.0 


1.707 


3.59 


3-6 


Oa'10/81 


158.0 


> 3.000 


>6.30 


3-7 


09/08/81 


84.0 


> 3.000 


>6.30 


3-8 


10/14/81 


180.0 


> 3.000 


>6.30 


3-9 


11/11/81 


154.0 


>3.000 


>6.30 
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ELISA RE 


SULTS oa 


TAINE 


D USING ANTl-IgG and ANTI-IgM MONOCLONAL CONJUGATE 


SAMPLE 


Ha tf 


Al T 


NANB EUSAS 








MONOCLONALS 


MONOCLONALS 


MONOCLONALS IGG 








IGG 1:10K iGM 


IGG 1:10K IGM 


1:10K IGM 1:120K 








1:30K 




1:60K 












OD 


S/CO 


OD 


SCO 


OD 


S/CO 


NEG CONTROL 






.100 




.080 




.079 




CUTOFF 
















PC (1:128) 






1.083 




1.328 




1.197 




CASE »\ 








1-1 


08 05/81 


40 


.173 




.162 




.070 




1-2 


09''02'81 


274 


.194 




.141 




.079 




1-3 


10.'07/81 


261 


.162 




.129 




.063 




1-4 


I M 9-81 


75 


.812 




.85 




.709 




1-5 


12' 15/81 


71 


>3.00 




>3.00 




>3.00 




Case *2 


2-1 


10 19. 81 


17 


442 




.045 




.085 




2-2 


11 17.81 


46 


102 




.029 




.030 




2-3 


12 02-81 


63 


.059 




.036 




.027 




2-4 


12 14/81 


152 


.065 




.041 




025 




2-5 


1^•23.'81 


624 


.082 




.033 




•032 




2-6 


01. '20/82 


66 


.102 




.042 




.027 




2-7 


02'15/82 


70 


.188 




.068 




.096 




2-8 


03/17/82 


24 


1.728 




1.568 




1.541 




2-9 


04/21.82 


53 


>3.C0 




2.443 




>3-00 




2-10 


05/19/82 


95 


>3.00 




>3.00 




>3.00 




Ml 


06/14/82 


37 


>3.00 




>3-00 




>3.00 




CASE n 






3-1 


04/07/81 


13 


.193 




.076 




.049 • 




3-2 


05/12/81 


236 


.201 




.051 




.038 




3-3 


OS30/81 


471 


.132 




.067 




.052 




3-4 


06'09/8l 


315 


.175 




.155 




.140 




3-5 


07/06/81 


273 


1.335 




1.238 




1.260 




3-6 


08/10/81 


158 


>3.00 




>3.00 




>3.00 




3-7 


09/08/81 


84 


>3.00 




>3.00 




>3-00 




3-8 


10/14/81 


180 


>3.00 




>3.00 




>3.00 




3-9 


11/11/81 


154 


>3.00 




>3.00 




>3,00 
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65 



TABLE 14 
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cLISA ncSL 


IL T b Ot 


ITAINEO USING POLYCLONAL CONJUGATES 


SAMPLE 


DATE 


ALT 


NANB ELISAS 








MONOCLONAL I:lOH 


TAGO 1 :80K 


JACKSON 1:80K 








OD 


S'CO 


00 


SCO 


00 


SCO 


NEG CONJflOL 






.076 




.045 




.154 




CUTOFF 






.476 




.545 




.654 




PC (1:128) 






1.390 




.727 
















M 


08 0581 


40 


.178 


.37 


.067 


.12 


.153 


23 


1-2 


09/02.81 


274 


.154 


.32 


.097 


18 


225 


.34 


1-3 


:'^'07.'8l 


261 


.129 


.2/ 


.026 


.05 


.167 


26 


1-4 


1 119. 81 


75 


.937 


1-97 


.324 


.60 


.793 


1 21 


1-5 


12/15.B1 


71 


>3.00 


>6-30 


1.778 


3.27 


>3.00 


>4.59 


CASE #2 


• 




2-1 


lO'iaSl 


17 


.058 


12 


023 


.04 


.052 


.08 


2-2 


n 17-81 


46 


.050 


.n 


018 


.03 


.058 


.09 


2-3 


12 02 81 


63 


.047 


.10 


020 


.04 


060 


.09 


2-4 


12 14.81 


152 


.059 


12 


.025 


05 


.Ut)4 


.08 


2-5 


12/23. 81 


624 


.070 


.15 


.026 


.05 


.074 


1 \ 


2-6 


01. 20. 82 


66 


.051 


11 


.018 


.03 


058 


09 


2-7 


02 IS 82 


70 


.139 


.29 


.037 


07 


.1 46 




2-8 


03'17.'82 


24 


' 367 


3.92 


-355 


.65 


1-429 


2 19 


2-9 


04/21. 82 


53 


>3.0^ 


>6.30 


.748 


1.37 


>3.00 


>4.59 


2-10 


05 '19/82 


95 


>3.00 


>6.30 


1.025 


1.88 


>3.00 


>4.59 


2-n 


06.14.82 


37 


>3.00 


>6.30 


.917 


1-68 


>3.00 


>4.59 


Case <'3 










3-1 


04/07/81 


13 


.090 


.19 1 


-049 


.09 


.138 


21 


3-2 


05/1231 


236 


.064 


.13 


•04O 


.07 


.094 


.14 


3-3 


05.-30/81 


471 


.079 


.17 


.045 


.08 


.144 


.22 


3-4 


06/09/81 


315 


.211 


.44 


.085 


.16 


275 


-42 


3*5 


07'06/81 


273 


1.707 


3.59 


.272 


.50 


1.773 


2.71 


3-6 


oe'io/8i 


158 


>3.00 


>6.30 


1.347 


2.47 


>3.00 


>4.59 


3-7 


09/0^81 


84 


>3-00 


>6.30 


2.294 


4.21 


>3.00 


>4.59 


3-8 


10/14/81 


180 


>3.00 


>6.30 


>3.00 


>5.50 


>3.00 


>4.59 


3-9 


11/11/81 


154 


>3.00 


>6.30 


>3.00 


>5.50 


>3.00 


>4.S9 



'V ' 5-b. Samples from Random Blood Donors 

Samples from random blood donors (See Section iv.l.l.) were screened for HCV infection using the 
HCV c 100-3 ELISA. in which the antibody-enzyme conjugate was either an anti-IgG monoclonal conjugate, 
or a polyclonal conjugate. The total number of samples screened were 1077 and 1056. for the polyclonal 
conjugate and the monoclonal conjugate, respectively. A summary of the results of the screening is shown 
in Table 15. and the sample distributions are shown in the histogram in Fig. 44. 

The calculation of the average and standard deviation was perlormed excluding samples that gave a 
s.gnal over K5. i.e.. 1073 OD values were used for the calculations utilizing the polyclonal conjugate, and 
1051 for the anti-IgG monoclonal conjugate. As seen in Table 15. when the polyclonal conjugate was used. 
:he average was shifted from 0.0493 to 0.0931. and the standard deviation was increased from 0.074 to 
0.0933. Moreover, the results also show that if the criteria of x * 5SD is employed to define the assay 
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cuioit ine polyciorai-en;yme con,ugaie configuration .n rne ELlSA ,equ.res 3 h.gi.d. cuioll vaiu^ m.c 
.ndicaies a reduced assay speciLciy as cor. oared to the monoclonal system in add.i.on. as dep.cied m me 
h-siogram .n F,g. 44. a greaier separation of results between negative and positive distributions ocrurs wnen 
random blood donors are screened m an ELlSA using the ant.-lgG monoclonal coniugate as compared to 
the assay using a comrrercial polyclonal label. 





TABLE 15 




COMPARISON OF TWO ELlSA CONFIGURATIONS IN 
TESTING SAMPLES FROM RANDOM BLOOD DONORS 




CONJUGATE 


POLYCLONAL 
(Jackson) 


ANTMgG 
MONOCLONAL 


lb 


Number of samples 
Average (x) 

Standard deviation (SO) 
5 SD 

CUT-OFF (5 SO * X) 


1073 
0.0931 
0.U933 
0.4666 
0,5596 


1051 

0.04926 
0.07427 
0.3714 
0.4206 



20 
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IVJ_ Detection of HCV Seroconversion in NANBH Patients from a Vanety of Geographical Locations 

Sera from patients who were suspected to have NANBH based upon elevated ALT levels, and wno 
were negative in HAV and HBV tests were screened using the RIA essentially as described m Section IV.D.. 
except that the HCV CiOO-3 antigen was used as the screening antigen m the microtiier plates. As seen 
from the results nr«sented m Table iS. the RIA delected positive samples m a h.gh percentage of tne 
cases 

Table 16 
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40 



Seroconversion Frequencies for Anti-c 100-3 


Among NANBH Patients in Different Countries 


Country 


The 


Italy 


Japan 




Netherlands 






No. Examined 


5 


36 


26 


No. Positive 


3 


29 


19 


% Positive 


60 


80 


73 



^5 IV.K. Detection of HCV Seroconversion in Patients with "Community Acquired" NANBH 



Sera which was obtained from 100 patients with NANBH. for whom there was no obvious transmission 
route (i.e.. no transfusions, i.v. drug use. promiscuity, etc. were identified as risk factors), was provided by 
Dr. M. Alter of the Center for Disease Control, and Dr. J. Dienstag of Harvard University. These samples 
were screened using an RIA essentially as described in Section IV.D.. except that the HCV cl00-3 antigen 
was used as the screening ant.gen anached to the microliter plates. The results showed that of the lOO 
serum samples. 55 contained antibodies that reacted immunologically with the HCV c 100-3 antigen. 

The results describe:* above suggest that "Community Acquired" NANBH is also caused by HCV. 
Moreover, since it has been demonstrated herein that HCV is related to Fiaviviruses. most of which are 
transmitted by arthropods, it is suggestive that HCV transmission in the "Community Acquired" cases also 
results from arthropod transmission. 



0/ 



iv_L_ Ccmoanson oT inooence o( HCV Annpog.fcs nnd Surrogate MafKers Donors impi.cateo .n NANBH 
Transmission • ~* " 



A prospeci.ve siuOy was earned oui to aeierrr^.ne whether recipients ol Diood (rom suspected NAN8H 

b pos.ifve donors, who developed NANBH. seroconverted to anti-HCv-anLPody positive. The blood donors 
were tested lor me surrogate marker abnormalities which are currently used as markers for NANBH 
infection. ».e.. elevated ALT levels, and the presence of anti-core antibody. In addition, the donors were also 
tested lor the presence of anti-HCV antibodies. The detern^mation of the presence of anti-HCV antibodies 
was determined using a radioimmunoassay as described in Section IV.K. The results of the study are 

JO presenter m Table 17. which shows: the patient number (column i); the presence of ani.-HCV annbod.es m 
patient serum (column 2); the number of donations received by the patient, with each donation being from a 
different donor (column 3); the presence of anti-HCV antibodies in donor serum (column 4); and the 
surrogate abnormality of the donor (column S) (NT or - means not tested) (ALT is elevated transaminase, 
and ANTI-HBc is anti-core antibody). 

^5 The results »n Table 17 demonstrate that the HCV antibody test is more accurate m detecting infected 
blood donors than are the surrogate marker tests. Nine out of ten patients who developed NANBH 
symptoms tested positive for anti-HCV antibody .r'^rocor. version. Of the n suspected donors, (pauent 6 
received donations from two different individuals suspected of being NANBH earners). 9 were positive for 
anti-HCV antibodies, and i was borderline positive -..nd therefore equivocal (donor for pauent ^). In contrast. 

20 using the elevated ALT test 6 of the ten donors tested negative, and usmg the anncore-aniibody test 5 of 
the ten donors tested negative. Of greater consequence, though, in three cases (donors to patients 8. 9. ano 
10) the ALT test and the ANTI-HBc test yielded inconsistent results. 



Table 17 
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DEVELOPMENT OF ANTI-HCV ANTIBODIES IN PATIENTS RECEIVING BLOOD 






FROM DONORS SUSPECTED OF BEING NANBH CARRIERS 






Patient 


Anti-HCV 


I No. Of 


Anti-HCV 


Surrogate 


30 




Seroconversion m 


L^o lations Donors 


Positive Donor 


Abnormality 






Patient 




















Alt 


Anti-HB 




1 


yes 


18 


equiv 


no 


no 


35 


2 


yes 


18 


yes 


NT 


yes 




3 


yes 


13 


yes 


no 


no 




4 


no 


18 


no 








5 


yes 


16 


yes 


yes 


yes 


40 


6 


yes 


11 


yes(2) 


no 


no 












yes 


yes 




7 


yes 


15 


yes 


NT 


no 




8 


yes 


20 


yes 


no 


yes 




9 


yes 


5 


yes 


yes 


no 


^5 


to 


yes 


15 


yes 


no 


ye^ 



"Same donor as anti-NANBV Positive. 
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'^•M- Amplification for Cloning of HCV cDNA Sequences Utilizing the PCR and Primers Derived from 
Conserved Regions of Flavivirus Genomic Sequences ~ 

The results presented supra., which suggest that HCV is a flavivirus or flavi-like virus, allows a strategy 
55 for cloning uncharacterized HCV cDNA sequences utilizing the PCR technique, and primers derived from 
the regions encoding.-cQnserved amino acid sequences in flaviviruses. Generally, one of the primers is 
derived from a defined HCV genomic sequence, and the other primer which flanks a region of unsequenced 
HCV polynucleotide is derived from a conserved region of the flavivirus genome. The flavivirus genomes 
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are known lo conjam ccr^served seouences vy.in.n me NSi. jna E pdypepudes. wn.ch are oncodeo m the 
5 -region ot ihe (laviv.rus genome. Corrc: oondmg sequences encoding tnese regions Uh upsiream of ihc 
HCV cDNA sequence shown in F,g. 26. Thus, to .soiaie cDNA sequences derived from ih.s region of the 
HCV genome, upstream pr.mers are designed which are derived trom the conserved sequences wimm 
these flavivifus poiyp^^ptides The downstream prirriefs are derived from an ups'ream end of the known 
portion of the HCV cDNA. 

Because of the degeneracy of the cede, it is probable that there will be mismatches between the 
flavivirus pro-bes and the corresponding HCV genomic sequence. Therefore a strategy which is similar to 
the onB described by Lee (1988) is used. The Lee procedure utilizes mixed oligonudeoiide primers 
complementary to the reverse translation products of an amino acid sequence: the sequences in the mixed 
primers lakes into account every codpn degeneracy for the conserved amino acid sequence- 
Three sets of primer mixes are generated, based on the amino acid homologies found in several 
flaviviruse?. including Oengue-2.4 (D-2.4). Japanese Encephalitis Virus (JEV). Yellow Fever (YF). and West 
Nile Virus iV^N). The primer mixture derived from the most upstream conserved sequence (S'-l). is based 
J 5 upon the amino acid sequence glytrp-gly. which is part of the conserved sequence asp-arg.giy.trp-gly- 
aspN found m the E protem of 0-2. JEV. YF. and V^'\'. The next primer mixture (5*-2) is based upon a 
downstream conseived sequence in E protein, phe-asp-gtyasp-ser-tyr-ileu-phe-gly-asp-ser-iyr-iieu. and is 
derived from phe-giy-asp; the conserved sequence is present in D-2, JEV. YF. and WN. The third pnmer 
mixture (5 3). is based on the amino acid sequence arg-ser-cys. which ts pan of the conserved sequence 
cys-cys-arg-ser-cys m the NSi protem of 0-2. 0-4. JEV. YF. and WN. The individual primers which form 
the mixture in 5 -3 are shown m Fig. 45. In addition to the varied sequences derived from conserved region, 
each primer in each mixture also contains a constant region at the 5 -end which contains a sequence 
encoding sues for restriction enzymes. Hindlll. fvlbol. and EcoRI. 

The dowr.stream primer. ssc5h20A. is denved from a nucleotide sequence in clone 5h. which ccniam: 
HCV cDNA with sequences with overlap those m clones i4i and nb. The sequence of ssc5h20A is 

5* GTA ATA TGG TGA CAG AGT CA 3*. 

An alternative pnmer. si>c5h34A. may ji;o be used. This pnmer is derived from a sequence m clone 5h. 
30 and in addition contains nucleotides at the s'-end which create a restriction enzyme site, thus facilitating 
cloning. The sequence of ssc5h34A is* 

5' GAT CTC TAG AGA AAT CAA TAT GGT GAC AGA GTC A 3*. 

OS The PGR reaction, which was initially described by Saiki ei al. (1986). is carried out essentially as 
described m Lee et al. (1988). except that the template for the cDNA is RNA isolated from HCV infected 
chimpanzee liver, as described m Section IV*C.2.. or from viral panicles isolated from HCV infected 
chimpanzee serum, as described in Section IV.A.l. In addition, the annealing conditions are less stnngent in 
the first round of amplification (0.6M NaCI. and 25 'C). since the pan of the primer which will anneal to the 

40 HCV sequence is only 9 nucleotides, and there could be mismatches. Moreover, if ssc5h34A is used, the 
additional sequences not derived from the HCV genome tend to destabilize the pnmer-template hybrid. 
After the first round of amplification, the annealing conditions can be more stringent (O.OSSf^ NaCI. and 
32 C-37 C). since the amplified sequences now contain regions which are complementary to. or duplicates 
of the primers. In addition, the first 10 cycles of amplification are run with Klenow enzyme I. under 

45 appropriate PCR conditions for that enzyme. After the completion of these cycles, the samples are 
extracted, and run with Tag polymerase, according to kit directions, as furnished by Cetus/Perkin-Eimer. 

After the amplification, the amplified HCV cDNA sequences are detected by hybridization using a probe 
denved from clone 5h. This probe is derived from sequences upstream of those used to derive the primer, 
and does not overlap the sequences of the clone 5h derived primers. The sequence of the probe is 
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5 CCC AGC GGC GTA CGC GCT GGA CAC GGA GGT GGC CGC GTC GTG TGG CGG TGT TQT TCT 
CGT CGG GTT GAT GGC GC 3'. 



'V N i. Creation of HCV cONA Library from liver of a Chimpanzee with infectious NANBH 

An HCV cONA library was created from liver from the chimpanzee from which the HCV cDNA library .n 
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Secfon IV.A 1 was created Tae lechn.Que iot oeanng the library was s.m.iar lo thai .n Section iva24 
except for mis u'.fiere'nt source of the Rna. and that a pnr.^er based on the sequence of HCV cDNa ,n clone 
1 lb was used. The sequence o( the pnmer wa^ 



5 5 CTG GCT TGA AGA ATC 3* 



10 
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'^Q^^'Q" !22 ""^'^Q^'^Q sequence of cerlapping HCV cDNA m clone k90 to cDNA in ctone np 



. Clone k9M was isolated from the HCV cDNA library created from the l.ver of an NANBH infected 
chimpanzee, as described m Section IV.A 25. The library was screened tor clones which overlap the 
sequence m clone nb. by using a clone which overlaps clone nb at the s'-terminus. clone Me The 
sequence of clone nb is shown in Fig. 23. Positive clones were isolated with a frequency of l in 500 000 
^5 One isolated clone. k9-l. was subjected to further study. The overlapping nature of the HCV cDNA m clone 
k9-i. to the 5 •end of the HCV-cDNA sequence m Fig. 26 was confirmed by probmg the clone with done 
Alex 45; this latter c::-c contains an HCV cDNA sequence of 30 base pairs which corresponds to those 
Dase pairs at the 5 terminus of the HCV cONA in clone I4i. described supra.. 

The nucleotide sequence of the HCV cONA isolated from clone k9-i was determined using the 
20 techniques described supra. The sequence of tho HCV cDNA in clone k9-i. the overlap with the HCV cDNA 
in Fig. 26. and the amino acids encoded therein are shown in Fig. 46. 

The HCV cDNA sequence in clone k9-i has been aligned with those of the clones described m Section 
IV.A. 19. to create a composite HCV cDNA sequence, with the k9-i sequence being placed upstream of the 
sequence shown .n Fig. 32. The composite HCV cDNA which includes t.he k9-i sequence and the am.no 
35 acrds encoded therein is shown in Fig. 47. 

The sequence of the ammo acids encoded in the s'-region of HCV cDNA shown m Fig. 47 has been 
compared with the corresponding region of one of the strains of Dengue virus, des.cribed supra., wnh 
respect to the profile of regions of hydrophobicity and hydrophilicity. This comparison showed that the 
polypeptides from HCV and Dengue encod'c* in this region, which corresponds to the region encoding NSi 
30 (or a portion thereof), have a similar hydrophobiChydrophiiic profile. 

The information provided infra. a.Mows the identification of HCV strains. The isolation and characteriza- 
tion of other HCV strains may be accomplished by isolating the nucleic acids from body components which 
contain viral panicles, creating cONA libraries using polynucleotide probes based on the HCV cDNA probes 
described m'ra.. screening the libraries for clones containing HCV cDNA sequences described infra., and 
comparing the HCV cDNAs from the new isolates with the cDNAs described infra. The polypeptides 
encoded therein, or in the viral genome, may be monitored for immunological cross-reactivity utilising the 
polypeptides and antibodies described supra. Strains which fit within the parameters of HCV, as described 
in the Definitions section, supra., are readily identifiable. Other methods for identifying HCv strains will be 
obvious to those of skill in the art. based upon the information provided herein. 
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Industrial Applicability 



The invention, m the various manifestations disclosed herein, has many industrial uses, some of which 
are the following. The HCV cONAs may be used for the design of probes for the detection of HCV nucleic 
acids in samples. The probes derived from the cDNAs may be used to detect HCV nucleic acids in. for 
example, chemical synthetic reactions. They may also be used m screening programs for anti-/iral agents, 
to determine the effect of the agents in inhibiting viral replication in cell culture systems, and animal model 
systems. The HCV polynucleotide probes are also useful in detecting viral nucleic acids in humans, and 
thus, may serve as a basis for diagnosis of HCV infections in humans. 

In addition to the above, the cDNAs provided herein provide information and a means for synthesizing 
polypeptides containing epitopes of HCV. These polypeptides are useful in detecting antibodies to HCV 
antigens. A series of immunoassays for HCV infection, based on recombinant polypeptides containing HCV 
epitopes are described herein, and will find commercial use in diagnosing HCV induced NANBH. in 
screening blood bank donors for HCV-caused infectious hepatitis, and also for detecting contaminated blood 
from infectious tjiood donors. The viral antigens will also have utility in monitoring the efficacy of anti-viral 
agents in ani.mal model systems. In addition, the polypeptides derived from the HCV cONAs disclosed 
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nofOifi vv.ii have utility as vaccmu? tor ireatrneni qi hCV mlocnofis 

The poiyoepiides der.ved from the HCV ONAs. bes.des mo abovo siaiM wses are ;,i:>o u^elui (o. 
.aising ani,.HCV ani.bod.es. Thus, ihey may oe useo .n ant.-HCV vaccnes Howevei. in^ ani.bo^Jiec 
produced as a result of immun.zai.on w.ih the HCV polypeptides are also use/ui .n deioct.ng me preserve 
5 of v.ral antigens .n samples. Thus, they may be used lo assay the production of HCV polypeptides in 
chemical systems. The ant.-HCV antibod.es may also be used to mon.tor the eff.cacy of ant..v.rai agents 
screening programs where these agents are tested m tissue culture systems. Th-y may also be used lor 
passive immunotherapy, and to diagnose HCV caused NANBH by allowing me detection of v.ral ant.gen(s) 
.n both blood donors and recipients. Another imponani use for ann-HCV antibodies .s .n alfm.ty chromaiog^ 
w raphy for- the purification of virus and viral polypeptides. The punfieo virus and viral polypeptide prepara- 
tions may bo used m vaccnes. However, the purified virus may also be useful for the development ol cell 
culture systems in which HCV replicates. 

Cell culture systems containing HCV infected cells will have many uses. They can be used lor me 
relatively targe scale production of HCV. which is normally a low titer v.rus. These systems will also be 
n useful for an elucidation of the molecular biology of the virus, ana lead to the development of anti-v.ral 
agents. The cell culture sysli-ms will also be useful m s-eemng for the eff.cacy of antiviral agents, in 
addition. HCV permissive cell culture systems are useful for the production of attenuated strains of HCV. 

For convenience, the anti-HCV antibodies and HCV polypeptides, whether natural or recombinant, may 
be packaged into kits. 

'JO The method used for isolating HCV cDNA. wh.ch .s comprised of preapnng a cDNA library derived from 
infected tissue of an individual, in an expression vector, and selecting clones which produce the expression 
products which react immunologically with antibodies m antibody-containing body components from other 
infected individuals and not from non-infected individuals, may also be applicable to the isolation of cDNAs 
derived from other heretofore uncharactenzed d.sease-assocaied agents whicn are compr-sed of a genomt.: 
component. Th.s. m turn, could lead to isolation and characterization of these agents, and to diagnostic 
reagents and vaccines for these other disease-associated agents 
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Claims 

1. A punfied HCV polynucleotide. 

2. A recombinant HCV polynucleotide. 

3. A recombinant polynucleotide compnsing a sequence derived frcm an HCV genome or from HCV 
cDNA. 

35 4. A recombinant polynucleotide encoding an epitope of HCV. 

5. A recombinant vector containing the polynucleotide of claim 2. or claim 3. or ciaim 4. 

6. A host cell transformed with the vector of claim 5. 

7. A recombinant expression system comprising an open reading frame (ORF) of DNA derived from an 
HCV genome or from HCV cONA. wherein the ORF is operably linked to a control sequence compatible 

40 with a desired host. 

8. A cell transformed with the recombinant expression system of claim 7. 

9. A polypeptide produced by the cell of claim 8. 

10. Purified HCV. 

11: A preparation of polypeptides from the HCV of claim 10. 
45 12. A purified HCV polypeptide. 

13. A purified polypeptide comprising an epitope which is immunologically identifiable with an epitope 
contained in HCV. 

14. A recombinant HCV polypeptide. 

15. A recombinant polypeptide comprised of a sequence derived from an HCV genome •-''.from HCV 
so cDNA. 

16. A recombinant polypeptide comprised o' an HCV epitope. 

17. A fusion polypeptide comprised of an HCV polypeptide. 

18. A monoclonal antibody directed against an HCV epitope. 

19. A punfied preoaration of polyclonal antibodies directed against HCV. 

55 20- A particle which is immunogenic against HCV infection comprising a non-HCV polypeptide having 
an amino acid sequence capable of forming a particle when said sequence is produced m a eukaryotic 
hos;. and r.o HCV epitope. 

21. A polynucleotide probe for HCV. 
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22 A k.i tor ana., ...10 samples lo. mo p,esenco ol Polynucieot.oes aor.vuO f^om hCV rornnr.c.nn 3 
poivnucteoi.de probe co.na.n.ng a nucieot.de segueuca Irom WCV ol aooui 8 or more Mudoonaf./ .n . 
suiiahie coniamer 

23 A k.t (Of analy^.rtg Ma.ples lo. iHe presence of an HCV ani.gen compr.s.ng an ani.booy o.fected 
against the HCV ant.gen to De detected, n a suiiaDle container. 

24. A kit lor analysing samples lor me presence ol an antibod.es d.iected aga.nst an hCv ant.oen 
comprising a polypcpi.oe containing an HCV epitope present .n the HCV antigen, m a suitable conta.ner 

25. A polypeptide comprised of an HCV epitope, attached to a solid substrate. 

26. ^n antibody to an HCV epitope, af.dched to a solid substrate. 

27 A method lor producmg a polypeptide containing an HCV epitope comprising .ncubat.ng nost cells 
transformed w.m an expression vector conta.n.ng a sequence encoding a polypept.de coma.n.ng an HCV 
epitope under conditions which allow exprer.sion ol said polypept.de. 

23. A polypeptide containing an HCV epitope produced by the motl.od of claim 27 

29. A niethod lor detecting HCV nucleic acids in a sample comprising: 

(a) reacting nucle.c acids of me sample wuh a prooe lor an HCV polynucleotide under conditions 
which allow the (ornr^ation of a polynuc.uot.de duplex between the probe and me HCV nucleic acd from me 
sample; and 

(D) detecting a polynucleotide duplex which contains the probe. 

30. An immunoassay lor detecting an HCV antigen comprising: 

(a) incubating a sample suspected of containing an HCV antigen with a probe antibody directed 
against the HCV antigen to be detected under conditions wh.ch allow the formation of an ant.gen-annboov 
complex: and 

(b) detecting an anngen-antibody complex containing the probe antioody. 

31. An immunoassay for delecting antibodies directed aga.nst an HCV anngen comprising: 

(a) incubating a sample suspected of containing anti-HCV antibodies with a probe p-»iypept.de whicn 
contains an epitope of the HCV. under conditions wh.ch allow the formation of an antib'-oy-antigen comoiex- 
and 

(b) detecting the antibody-antigen complex containing the probe antigen. 

32. A vaccine for treatment of HCV infection comprising an immunogenic polypeptide containing an 
HCV epitope wherein the immunogenic polypeptide is present .n a pharmacologically effective dose in a 
pharmaceuiicaliy acceptable excipient. 

33. A vaccine for treatment of HCV infection comprising inactivated HCV .n a pharmacologically 
effective dose m a pharmaceuticaliy acceptable excipient. 

34. A vaccine for treatment of HCV infection comprising attenuated HCV in a pharmacologically 
effective dose in a pharmaceuticaliy acceptable excipient. 

35. A tissue culture grown cell infected with HCV. 

36. The HCV infected cell of claim 35. wherein the cell is of a human macrophage cell line, or is of a 
hepatocyte cell line, or is of a mosquito cell line, or is of a lick cell line, or is of a mouse macrophage cell 
line, or is an embryonic cell. 

37. The HCV infected cell of claim 35. wherein me cell is of a coll line derived from liver of an HCV 
infected individual. 

38- A method for producing antibodies lo HCV comprising administering to an individual an isolated 
immunogenic polypeptide containing an HCV epitope in an amount sufficient to produce an immune 
response. 

39, A method for producing antibodies to HCV comprising administering to an individual the polypeptic^e 
preparation of claim n. wherein the preparation contains at least 1 immunogenic polypeptide, and the 
administering is of an amount sufficient to produce an immune response. 

40. A method for isolating cDNA derived from the genome of an unidentified infectious agent, 
comprising: 

(a) providing host cells transformed with expression vectors containing a cDNA library prepared from 
nucleic acids isolated from tissue infected with the agent and growing said host cells under conditions 
which allow expression of polypeptide(s) encoded in the cONA; 

(b) interacting me expression products of the cDNA with an antibody containing body component of 
an individual infected with said infectious agent under conditions which allow an immunoreaction. and 
detecting antibody-antigen complexes formed as a result of the interacting: 
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(C) growing nost cells wh.cn eip/ess polypeptides thai .'orm ani.tody-ani.gen complexes step (pj 
under conoiitons which allow their growin as muividual clones and isolating said clones: 

(d) growing cells from ihe Clones ol (c) under conditions which allow expression ct poiypepnaeis) 
encoded withm the cDNA. and mieracimg the expression products with antibody containing pody compo- 
nents o( H»div.duals oinei than the individual in step (a) who are infected with the infectious agent and w.ih 
control individuals uninfected w.ih the agent, and detecting antiDodyantigen complexes formed as a result 
of the interacting; 

(e) growing host cells which expresr polypeptides that form annbody-antigen complexes with 
antibody, containing body components of infected individuals and individuals suspected of being infected, 
and not with said components ol control individuals, under conditions which allow their growth as individual 
clones and isolating said clones: and 

(f) isolating the cDNA from the host cell clones of {e). 
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* FIG. I Translation of DNA 5-i-l 

AlaSerCysLeuAsnCysSerAlaSerllelleProAspArgGiuVaiLeuTyrArgGlu 
1 GGCCTCCTGCTTGAACTGCTCGGCGAGCATCATACCTGACAGGGAAGTCCTCTACCGAGA 
CCGGAGGACGAACTTGACGAGCCGCTCGTAGTATGGAC rGTCCCTTCAGGAGATGGCTCT 

PheAsoGluMetGluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeu 
61 GTTC^ATGAGATGGAAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCT 
CAAGC^ACTCTACCTTCTCACGAGAGTCJTGAATGGCATGTAGCTCGTTCCCTACTACGA 

AlaGluGlnPheLysGlnLysAlaLeuGlyLeu 
121 CGCCGAGCAGTTCAAGCAGAAGGCCCTCGGCCTCC 
» GCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 



FIG. 3 Transaction of DNA 5-1-1, 81, 91&1-2 

GlyCysValValileValGlyArgValValLeuSerGlyLysProAlallelleProAsp 

1 ctggctgcgtggtcat;.c;jggcagggtcgtcttgtccgggaagccggcaatcatacctg 
gaccgacgcaccagtatcacccgtcccagcagaacaggcccttcggccgttagtatggac 

T 

ArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeuProTyr 
6 1 ACAGGG AAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTACCGT 
TGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCA 

! A 

IleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeuGln 
121 ACATCG AGCAAGGG ATGATGCTCGCCG AGC AGTTCAAGC AG AAGGCCCTCGGCCTCCTGC 
TGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACG 

ThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLysLeu 
131 AGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAAC 
TCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTG 

V 

GluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAlaGly 
241 TCGAGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGTGGGATACAATACTTGGCGG 
AGCTCTGGAAGACCCGCTTCGTATACACCTTGAAGTAGTCACCCTATGTTATGAACCGCC 

LeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAlaVal 
301 GCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTG 
CGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGACGAC 

ThrSerProLeuThrThrSerGln 
361 TCACCAGCCCACTAACCACTAGCCA/iJV 
i \ AGTGGTCGGGTGATTGGTGATCGGTTT 



FIG. 4 Translation ot* DNA 81 

SerGlyLvsProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMet 
1 GTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGAT 
CAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTA 

GluGluCysSerSlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
61 GGAAGAGTGCTCTCAGCACTTACCGTACATCCAGCAAGGGATGATGCTCGCCGAGCAGTT 
CCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAA 

LysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaPro 
121 CAAGCAG AAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAG AGGTTATCGCCCC 
GTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGG 

AlaValGlnThrAsnTrpGlnLysLeuG] uThrPheTrpAlaLysHisMetTrpAsnPhe 
131 TGCTGTCCAGACCAACTGGCAAAAACTCGAGACCTTCTGG JCGAAGCATATGTGGAACTT 
ACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAA 

IleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAla 
241 CATCAGTGGG ATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGC 
GTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACG 

SerLeuMetAlaPheThrAlaAlaValXhrSerProLeuThrThrSerGln 
301 TTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTT 



FIG. 5 Translation of DNA 36 

AspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAla 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrp 
61 TACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACC 

LysCysLeuIleArgLeuLysProThrLeuHisGlyProThxProLeuLeuTyrArgLeu 
121 AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCys 
131 GGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATCACATGC 
CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

MetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGlyValLeuAla 
241 ATGTCGGCCG ACCTGG AGGTCGTCACG AGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 



AlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeu 
301 GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

Overlap with 81 

SerGlyLysProAlallelleProAspArgGluValLeuTyrArg 
361 TCCGGGAAGCCGGCAATCATACCTC ACAGGG AAGTCCTCTACCG AG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTC 



FIG^6 Combined ORF of JNAs 36 & 81 



AspAlaHisPheLeuSerGlnThrLysGlnSerGlyGIuAsnLeuProTyrl/euValAla 
1 GATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCG 
CTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGC 

TyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGltiMetTrp 
6 1 TACCAAGCCACCGTGTGCGCTAGGGCTCAAGrCCC-CCCCCATCGTGGGACCAGATGTGG 
ATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACC 

LysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeu 
121 AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTG 
TTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGAC 

GlyAiaValGltiAsnGluIleThrLauThrHisProValThrLysTyrlleMetThrCys 
181 GGCGCTGTTCAG AATG AAATCACC'JTGACGCACCCAGTCACCAAATACATCATG ACATGC 
CCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACG 

MetSerAlaAspLeuGluValVa^ThrSerThrTrpValLeuValGlyGlyValLeuAla 
24 1 ATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCGTGGCT 
TACAGCCGGCTGGACCTCCAGCAGTv;CTCGTGGACCCACGAGCAACCGCCGCAGGACCGA 

AlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeu 
301 GCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTG 
CGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAAC 

SerGlyLysProAlallelleProAspArgGLuValLeuTyrArgGluPheAspGluMet 
361 TCCGGGAAGCCGGCAATCATACCTG ACAGGGAACTCCTCT.VCCGAGAGTTCGATGAGATG 
AGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTAC 

GluGluCysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPhe 
421 G AAGAGTGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTC 
CTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAG 

LysGLnLysAlaLeuGlyLeuLeuGlnThrAlaSerArgGLnAlaGluVallleAlaPro 
4 81 AAGCAGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCT 
TTCGTCTTCCXWGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGA 

AlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhe 
541 GCTGTCCAGACCAACTGGCAAAAACTCGAG ACCTTCTGGGCGAAGC ATATGTGG AACTTC 
CGA<»GGTCTCGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCTTGAAG 

IleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAla 
601 ATXIJWSTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCT 
TAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGA 

SerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerCln 
661 TCATTG ATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAA 
AGTAACTACCGAAAATGTCGACGAavGTGGTOX^TGATTGGTGATCGGTTT 
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PI (3 7 Translation of DNA 32 



Overlap with 81 

PheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeu 
1 CTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATAT 
GAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATA 

GlyGlyTrpValAlaAlaGln]>-.uAlaAlaProGlyAlaAlaThrAlaPheValGlyAla 
61 TGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCG 
ACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGC 

GlyLeuAlaGlyAlaAlail.-V.ySerValGlyLeuGlyLysValLeurieAspIleLeu 
121 CTGGCTTAGCTGGCGCCGCC.^TCGGCAGTGTTGGACTGGGGAAGGTCCTCATAG ACATCC 
GACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGG 

AlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGlu 
181 TTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAG ATCATGAGCGGTG 
AACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCAC 

ValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeu 
241 AGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCC 
TCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGG 

valValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGluGlyAla 
301 TCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCG AGGGGG 
AGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCC 

ValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
361 CAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
GTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 
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FIG. 8 Translation of DNA 35 

SerlleGlur^rlleThrLeuProGlnAbpAlaValSerArgThrGlnArgArgGlyArg 
1 TCCATTGAGACAATCACGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGG 
AGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCC 

ThrGlyArgGlyLysProGlyIleTyrArgPheVaLAlaProGlyGluArgPrc^-»rGl7 
61 ACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGC 
TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCCG 

MetPheAspSerSerValLeuCysCluCysTyrAspAlaGlyCysAlaTrpTyrGluLeu 
121 ATGTTCGACTCGTCCGTCCTCTGTk'AGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTC 
TACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
131 ACGCCCGCCGAG ACTACAGTTA'- ':':TACG AGCGTACATGAACACCCCGGGGCTTCCCGTG 
TGCG3GCGGCTCTGATGTCA.aT»:CGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCAC 



CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAla 
241 TGCCAGG ACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAG ATGCC 
ACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 



HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuPrcTyrLeuValAlaTyrGln 
301 CACTTTCTATCCCAGACAAAGCAGAGTGGGGAG AACCTTCCTTACCTGGTAGCGTACCAA 
GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTT 

Overlap with 36 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 
361 GCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG AAGTGT 
CGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 



LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
421 TTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCT 
AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 



PIG. 9""! Combined ORF of DNAs 35. 36. 81 & 32 
SerlleGLuThrlleThrLeuProGInAspAlaVaLSerArgThrClaALrgAxqGLyArg 

1 tccattgagacaatcacgctcccccaggatgctgtctcccgcactcaacgtcggggcagg 
aggtaactctgttagtgcgaggggc:tcctacgacagagggcgtgagttgcagccccgtcc 

ThrGLyArgGlyLysProGlylleTyrArgPheValAlaProGlyGluArgProSerGly 
6 1 ACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCGGC 
TGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCG7GGCCCCCTCGCGGGGAGGCCG 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGLyCysAlaTrpTyrGluLeu 
121 ATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTC 
TACAAGCTGAGCAGGCAGGAGACACTCACGATATGCGTCCGACACGAACCATACTCGAG 

ThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
181 ACGCCCGCCGAGACTACAGTTAGGCTACG/ "-GTACATGAACACCCCGGGGCTTCCCGTG 
TGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCAC 

CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThrHisIleAspAia 
241 TGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATGCC 
ACGGTCCTGGTAGAACTTAAAaCCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTACGG 

HisPheLeuSerGlnXhrLysGlnSarCiyGluAsnLeuProTyrLeuValAlaTynJln- 
301 CACTTTCTATCCCAG ACAA/.GCAG AGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAA: 
GTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTT 

AlaThrValCysAlaArgAl aGl '-f laProProProSerTrpAspGlnMetTrpLysCys 

3 6 1 GCCACCGTGTGCGCTAGGGr.TCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGTGT 

CGGTGGCACACGCGATCCCoAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 

4 21 TTG ATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCT 

AACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGA 

ValGlnAsnGluIleThrL€uThrHisProValThrLysTyrIleMetThrCysy.etSer 
4 81 GTTCAGAATGAAATCACCCTGACGC.ACCCAGTCACCAAATACATCATGACATGCATGTCG 
CAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGC 

AlaAspLeuG luVa 1 Va iThrSerThrTrpValLeuVa IG lyGlyValLeuAlaAlaLeu 
341 GCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTG 
CGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAAC 

AlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeuSerGly 
601 GCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGG 
CGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCC 

LysProAlallelleProAspArgGluValLeuTyiArgGluPheAspGluMetGluGlu 
661 AAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAG 
TTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTC 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGln 
7 21 tGCTCTCAGCACTTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAG 
ACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTC 

LysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaVal 
781 AAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTC 
T7CCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAG 



GlnThxAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSer 
841 CAGACCAACTGGCAAAAACTCG AGACCTTCTGGGCGAAGCATATGTGGAACTTCATCAGT 
GTCTGGTTGACCGTTTTTGAGClCTGGAAGACCrCCTTCGTATACACCTTGAAGTAGTCA 

GlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeu 
901 GGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTG 
CCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAAC 

MetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsn 
961 ATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAAC 
TACCGAAAATGTCGACGACAGrGGTCGGGTGATTGGTGATCGGTTTGGGAGGACAAGTTG 

IleLeuGlyGlyTrpValAlaAlaClnleuAlaAlaProGlyAlaAlaThrAlaPheVal' 
1021 ATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTG: 
TATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACAC" 

GlyAlaGlyLeuAlaGlyAlr-A.' .^leGlySerValGlyLeuGlyLysValLeuIleAsp • 
1081 GGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGAC 
CCGCGACCGAATCGACCGCCJGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTG 

IleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSer 
1141 ATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGC 
TAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCG 

GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGly 
1201 GGTG AGGTCCCCTCCACGG AGG ACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGA 
CCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCT 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyProGlyGlu 
1261 GCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCGAG 
CGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTC 

GlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
1321 GGGGCAGTGCAGTGGATGAACCGGCTG ATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCC 
CCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGG 

FIG. 9-2 
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f\Q, iO Translation of DNA 37b 



LeuAlaAlaLysLeuValAlaLeuGl IleAsnAlaValAlaTyrTyrArgGlyLeuAsp 
1 CTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGAC 
GAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTG 



ValSerVallleProThrSerGlyAspValValValValAlaThrAspAlaLeuMetThr 
61 GTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCGATGCCCTCATGACC 
CACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGG 



GlyTyrThrGlyAspPheAspSerVallleAspTyrAsnThrCysValThrGlnThrVal 
121 GGCTATACCGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTC 
CCGATATGGCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAG 

Overlap with 

AspPheSerLeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaVal 
181 G ATTTCAGCCTTG ACCCTACCTTCACCATTGAGACAATCACCCTCCCCCAGGATGCTGTC 
CTAAAGTCGGAACTGOGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAG 

clone 35 

SerArgThrGlnArcArgGlyArgThr 
24 1 TCCCGCACTCAACGTCGGGGCAGGACTG 
AGGGCGTGAGTTGCAGCCCCGTCC" 



FIG. 11 Translation of DNA 33b 



Overlap with 32 

MetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValScrrroThrHisTyrVal 
1 GATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGT 
CTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCA 

ProGluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerLeuThrValThrGln 
61 GCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCAGCCTCACTGTAACCCA 
CGGCCTCTCGCTACGTCGACGGGCGCVGTGACGGTATGAGTCGTCGGAGTGACATTGGGT 

LeuLeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThrProCysSerGlySer 
121 GCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGTGTACCACTCCATGCTCCGGTTC 
CGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAG 

TrpLeuArgAspIleTrpAspTrprieCysGluValLeuSerAsDPheLysThrTrpLeu 
181 CTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCT 
GACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGA 

LysAlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGlyTyr 
241 AAAAGCTAAGCTCATGCCACAGCTGCCTGGG ATCCCCTTTGTGTCCTGCCAGCGCGGGTA 
TTTTCGATTCGAGTACGG.TGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCAT 

LysGlyValTrpArgVal 
301 TAAGGGGGTCTGGCG AGTG 
ATTCCCCCAGACCGCTCAC 
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FIG. 12 Translation of DMA 40b 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1 GGCTTACATGTrCAAGGCTCATGGGATCGATCCTA*»CATCAGGACCGGGGTGAGAACAAT 
CCGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGiySerProIleThrTyrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
61 TACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC • 

SerGlyGlyAlaTvrAspIlellelleCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATGACATAATAATTTGTGACGAGTGCCACTCCACGGATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
1 S 1 CATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGT 
GTAGAACCCGTAGCCGTGACAGGAArTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaThrAlaThrProProGlySerValThrValProHlsProAsnlleGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
301 TGCTCTGTCCACCACCGG AG AG ATCCCTTTTTACGGCAAGGCTATCCCCCTCG AAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTA 



LysGlyGlyArgHisLeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAla 
361 CAAGGGGGGGAG AC ATCTCATCTTCTGTCATTCAAAGAAG AAGTGCGACGAACTCGCCGC 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCG 

Overlap with 37b 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
4 21 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 



481 



IleProThr 
CATCCCGACCAG 
GTAGGGCTGGTC 



FIG. 13 Translation of DNA 25c 



CysSerLeuThrValThrGlnleuLeuArgArgLeuHisGlnTrpIleSerSerGluCys 
1 ACTGCAGCCT'^ACTGTAACCCAGCTCCTGAGGCGACTGCACCAGTGGATAAGCTCGGAGT 
TGACGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCA 



ThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeu 
61 GTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGT 
CATGGTGAGGTACGAGGCv'ZAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACA 

Overlap with 33b 

SerAspPheLysThrTrpLeuLy.«:Al^LysLe'aMetProGlnLeuProGlyIleProPhe 
1 :: 1 TG AGCGACTTTAAG ACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGG ATCCCCT 
ACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCGTAGGGGA ■ 



ValSerCysGlnArgGlyTyrLysGlyValTrpArgGlyAspGlylleMetHisThrArg 

131 ttgtgtcctgccagcgcgggtataagggggtctggcgaggggacggcatcatgcacactc 
a;^cacaggacggtcgcgcccatattcccccagaccgctcccctgccgtagtacgtgtgag 

CysHisCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArglleValGly 
241 GCTGCCACTGTGGAGCTG AG ATCACTGGACATGTCAAAA.^CGGGACGATGAGG ATCGTCG 
CGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGC 

ProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrGly 
301 GTCCTAGG ACCTGC AGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGG 
CAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCC 

ProCysThrProLeuPrbAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGlu 
361 GCCCCTGTACCCCCCTTCCTGCGCCG AACT ACACGTTCGCGCTATGGAGGGTGTCTGCAG 
CGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTC 

GluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGlyMetThrThrAsp 
4 21 AGG AATATGTGG AG ATAAGGCAGGTGGGGGACTTCCACTACGTG ACGGGTATGACTACTG 
TCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGAC 

AsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
4 81 ACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAAT 
TGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 14-lccmbined ORF of DNAs 40b/37b/35/36/81/32/33b/25c 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlleArgThrGlyValArgThrlle 
1 TGCTTACATGTCCAAGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAAT 
ACGAATGTACAGGTTCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTA 

ThrThrGlySerProIleThrTvrSerThrTyrGlyLysPheLeuAlaAspGlyGlyCys 
6 1 - -TACCACTGGCAGCCCCATCACGT ACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGTG 
ATGGTGACCGTCGGGGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCAC 

SerGlyGlyAlaTyrAspIlellelLeCysAspGluCysHisSerThrAspAlaThrSer 
121 CTCGGGGGGCGCTTATG ACATAATAATTTGTGACGAGTGCCACTCC ACGG ATGCCACATC 
GAGCCCCCCGCGAATACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAG 

IleLeuGlylleGlyThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValVal 
181 C ATCTTGGGC ATCGGCACTGTCCTTG Arc AAG Z AGAG ACTGCGGGGGCG AG ACTGGTTGT 
GTAGAACLCGTAGCCGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACA 

LeuAlaThx*AlaThrProProGlySerValThrValProHisProAsnIleGluGluVal 
241 GCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCC AACATCG AGG AGGT 
CGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCA 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
301 TGCTCTGTCCACCACCGGAGAG ATCCCTTTTTACGGCAAGGCTATCCCCCTCG AAGTAAT 
ACGAGACAGGTGGTGGCCTCTCTAGGGA.\AAATGCCGTTCCGATAGGGGGAGCTTCATTA 

LysGlyGlyArgHisLeuIlePheCysHisSerLvsLysLysCysAspGluLeuAlaAla 
361 CAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGQ 
GTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTi^CACGCTGCTTGAGCGGCG^: 

LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
4 21 AAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGT 
TT7CGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCA 

IleProThrSerGlyAspValValValValAlaThrAspAlaLeir^.etThrGlyTyrThr 
4 81 CATCCCGACCAGCGGCGATGTTGTCGTCGTGGCAACCG ATGCCCTCATGACCGGCTATAC 
GTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATG 

GlyAspPheAspSerVallleAspTyrAsnThrCysValThrGlnThrValAspPheSer 
541 CGGCGACTTCGACTCGGTGATAGACTACAATACGTGTGTCACCCAGACAGTCGATTTCAG 
GCCGCTGAAGCTGAGCCACTATCTGATGTTATGCACACAGTGGGTCTGTCAGCTAAAGTC 

LeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
601 CCTTGACCCTACCTTCACCATTGAG ACAATCACGCTCCCCCAGG ATGCTGTCTCCCGCAC 
GGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTG 

GlnArgArgGlyArgThrGlyArgGlyLysProGlylleTyrArgPheValAlaProGly 
661 TCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTGTGGCACCGGG 
AGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCC 

GluArgProSerGlyMetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCys 
721 GGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATGACGCAGGCTG 
CCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGAC 

AlatrpTyrGluLeuThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThr 
781 TGCTTGGTATGAGCTCACGCCCGCCG AG ACTACAGTTAGGCTACG AGCGTACATG AACAC 
ACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTG 

ProGlyLeuProValCysGlnAspHisLeuGluPheTrpGIuGlyValPheThrGlyLeu 
841 CCCGGGGCTTCCCGTGTGCCAGG ACCATCTTG AATTTTGGG AGGGCGTCTTTAC AGGCCT 
GGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGGA 

ThrHisIleAspAlaHisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeaProTyr 
901 CACTCATATAGATGCCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTA 
GTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAAT 
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LeuValAlaTyrGlnAlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAsp 
961 CCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGA 
GGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCT 

GlnMetTrpLysCysLeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeu 
1021 CCAGATGTGGAAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCT 
GG5CTACACCTTCACAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGA 

TyrArgLeuGlyAlaValGlnAsnGluIieThrLeuThrHisProValThrLysTyrlle 
1081 ATACAG ACTGGGCGCTGTTCAG AATGAAATCACCCTGACGCACCCAGTCACCAAATACAT 
TATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTA 

MetThrCysMetSerAlaAspLeuGluValValThrSerThrTrpValLeuValGlyGly 
1141 CATG AC ATGC ATGTCGGCCG ACCTG JAGGTCGTCACGAGCACCTGGGTGCTCGT1*GGCGG 
GTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCC 

ValLeuAii^AlaLeuAlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArg 
1 2C 1 CGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAG 
GCAGGACCGACCAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTC 

ValValLeuSerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPhe 
1261 GGTCGTCTTGTCCGGG AAGCCGGCAATCATACCTG ACAGGG AAGTCCTCTACCGAG AGTT 
CCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAA 

AspGluMetGluGluCysSerGlnHisLeuProTyrlleGIuGlnGlyMetMetLeuAla 
13 21 CGATGAGATGGAAGAGTGCTCTCAOCAtTTACCGTACATCGAGCAAGGGATGATGCTCGC 
GCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCG 

GluGlnPheLysGlnLysAlaLeuGlyLeuLeuGlnThrAlaSerAryGlnAlaGluVal 

13 31 CGAGCAGTrCAAGCAGAAGGC'IC^CGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGT 

GCTCGTCAAGTTCGTCTTCCGGGaGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCA 

IleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMet 

14 41 TATCGCCCCTGCTGTCCAGACCi^J^CTGGCAAAAACTCG AGACCTTCTGGGCGA/.GCATAT 

ATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTATA 

TrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
1501 GTGGAACTTCATCAGTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACCC 
CACCTTGAAGTAGTCACCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
1561 CGCCATTGCTTCATTCATGGCTTTTACAGCTGCTGTCACCAGCCCACTAACCACTAGCCA 
GCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGGT 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAla 
1621 AACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGC 
TTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACG 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAloAlalleGlySerValGlyLeuGly 
1681 CGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGG ACTGGG 
GCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACCC 

LysValLeuIleAspIleLc-.:iAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuVaLAla 
1741 G AAGGTCCTCATAG ACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGG AGCTCTTGTGGC 
CTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCG 

PheLysIleMetSerGlyGluValProSerThrGluAspLeuValAsnLeuLeuProAla 
1801 ATTCAAGATCATGAGCGGTGAGGTCCCCTCCACGG AGG ACCTGGTC AATCTACTGCCCGC 
TAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCG 

IleLeuSerProGlyAlaLeuValValGlyValValCysAlaAlalleLeuArgArgHis 
1361 CATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCA 
GTAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGT 

FIG. 14 -Z ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArg 



£P 0 318 216 Al 



1921 CGTTGGCCCGGGCG AGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCG 
GCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGC 

GlyAsnHisValSerProThrHisTyrValProGluSerAspAlaAlaAlaArgValThr 
1981 GGGGAACCATGTTTCrCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCAC 
C^CCTTGGTACAAAGGGGGTGCGTGAxGCACGGCCTCTCGCTACGTCGACGGGCGCAGTG 

AlalleLeuSerSerLeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSer 
2041 TGCCAT ACTCAGCAGCCTCACTGTAACCCAGCTCCTG AGGCG ACTGCACCAGTGG ATAAG 
ACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTC 

SerGluCysThrThrProCysSerGlySerTrpLeuAroAspIleTrpAspTrpIleCys 
2101 CTCGGAGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATG 
GAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATAC 

GluValLeuSerAspPiieL/sThrTrpLeuLysAlaLysLeuMetProGlnLeuProGly 
2161 CG AGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGG 
GCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACC 

lleProPheValS ."ysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
2221 GATCCCCTTTGTGTCCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCAT 
CTAGGGGAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTA 

HisThrArgCysHisCysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArg 
2281 GCACACTCGCTGCCACTGTGGAGCTGAGATCACTGG AC ATGTCAAAAACGGGACG ATGAG 
CGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTC 

IleValGlyProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyr 
2341 GATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTA 
CTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGAT 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
24 01 CACCACGGGCCCCTGTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGG AGGGT 
GTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCA 

SerAlaGluGluTyrValGluIleArgGlnValGlyAspPheHisTyrValThrGl-yMet 
24 61 GTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTAT 
CAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATA 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGlu 
2521 GACTACTGACAATCTCAAATGCCCGTGCCAGGTCCGATCGCCCGAA.TTTTTCACAGAAT 
CTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 



FIG. 14-3 



EP 0 318 216 A1 



FIG. 15 Translation of DNA 3 3c 

AlaValAspPhelleProValGluAsaLeuGluThrThrHetArgSerProValPheThr 
1 GGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCAC 
CCGCCACCTGAAATAGCGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTG 

AspAsnSerSerProProValValBroGlnSerPheGlnValAlaHlsL«uHisAlaPro 
6 1 GGATAACTCCTCTCCACavGTAGTGCCCacGAGCTTCCAGGTGGCTCACCTCCATGCTCC 
CCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGG 

ThrGlySerGlyLysScrthrLysValProAlaAlaTyrAlaAlaGlnGlTTTTLysVal 
121 CACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCrGCATATGCAG>:TCAGGGCTAIAAGGT 
GTGTCCGTCGCCG i"mt. GTGGTTCCAGGGCCGACGTATACGTCGA GTCCCGATATTCCA 

LeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTTTMetSerLysAla 
181 GCTAGTACrrjUUICCCTCTGTTGCTGCAACACTC JGCTTTGGTGCTTACATGTCCAAGGC 
CGATCJVTGAGri«GGGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCC^ 



-Overlap with 40b- 



HisGlylleAspPrtjASDileArgThrGlyValArgThrlleThrThrGlyScrProIle 
241 TCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCAT 
AGTACCCIAGCIJUXLVTTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGCW^ 



ThrTyrSerrhrTTiGlyLysPheLeuAlaAspGlyClyCysSerGlyGlyAlaTTrAsp.. 
301 CACGTACTCCACCTACGGOU^GTTCCr.viCCGACGGCGGGTGCTCGGGGGGCGCTTA'KA I 
GTGCATGAGGTGGATGCGGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACT 



IlellelleCysAspGluCy.'H : iSerThrAsDAlaThrSerlleLeuGly IleGlyThr 
361 <:ATAATAATTTGTGAC6AGTGCCACTCCACGGAix:CCACATCCATCTTGGGCATTGGCAC 
GTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAACCGTG 



ValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValValI.euAlaThrAlaThrPro 
421 TGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCACCCC 
ACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGG 



ProGlyS-arValThrValProHisProAsnlleGluGluValAlaLeuSerThiThrGly 
481 TCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGG AGGTTGCTCTGTCCACCACCGG 
AGGCCCGAGGCAGTGACAOKWGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCC 

GluIleProPheTyrGlyLysAlalleProLeuGluVallleLysGlyGlyArgHisLeu 
541 AGAGATCCCTTTTTACGGCAAGGCTAXCCCCCTCGAAGTAATCAAGGGGGGGAGACATCr 
TCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGA 



IlePheCysHlsSerLTsLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeuGly 
601 CATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGG 
GTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTrCGACCAGCGTAACCC 



IleAsnAlaVaLAlaTyrtyxArgGlyLeuAspValSerVallleProThrSerClyAsp 
661 CAXCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGA 
GTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCr 

ValValValValAlaThrAspAlaLeuKetXhrGlyTyrthxClyAspPheAspSerVal 
721 TGTTGTCGTCGTGGCAACCGATGCCCTCATG ACCGGCTATACCGGCG ACTTCG ACTCGGT 
ACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCA 

IleAssCysAsnThrCys 
7a I GATAGACTGC^ATACGTGTG 
CTArcrGACGTTATGCACAC 



FIG. 16 Translation of DMA 8h 

_ ProC73ThrC7sGl7SexS.*xAspLeuT7rLeuValThrArgHisAlaAspValIlePro 
I CrCCCrGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGOVCGCCGATG^ 
GAGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAfiTAAG 

ValArgAr^ArgGlyAspSerArgGlySexLeuLeuSerProArgProIleSerTTrLeu 
6 1 CCGTGCGCCGGCGGGGTGATACCAGGGGCAGCCTGCTGTCGCCCCGGCCCATTTCCTACT 
GGCACGCGGCCGCCCGkCTATCGTCCCCGTCGGACGACAGCGGCGCCGGGTAAAGGATGA 

L7sGl7SexSexGl7GlyProLeuLeuC7sPrcAlaGlyHisAlaValGl7llePheArg 
121 TGAAAGGN.TCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACGCCGTGGGCATATTTA 
ACTTTCCGAGGAflCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAAT 

Ovexlap with 

AlaAlaValC7sThrArgGl7ValAlaL7sAlaValAspPheIleProValGluAsnI/eu 
181 GGGCCGCGGTGTGCACCCGTGGAGTGGCTAACGCGGTGGACTTTATCCCTGTGGAGAACC 
CCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGG 

33c 

GluThrThrMetArgSerPicv^alPheThrAspAsnSer 
241 TAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTC 
ATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAG 



FIG. 17 Translation of DMA 7e 

Gl7TrpArgLexiLe\iAlaProlleThrAlaT7rAlaGlnGlnThrArgGl7LeuLeuGl7 
1 GGGGTGGAGGTTGCTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGG 
CCCCACCTCCAACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCC 

C7SIleIleThrSerLeuThrGl7ArgAspL7sAsnGlnValGluGlyGluValGlnIle 
6 1 GTGCATAATCACCAGCCT AACTGGCCGGG ACAAAAACCAAGTGGAGGGTGAGGTCCAG AT 
CACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTA 

ValSexThrAlaAlaGln!rhrPheLeuAlaThxC7SIleAsnGlyValC7sTrpThrVal 
121 TGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGT 
ACACJlGTTGACGACGGGTTTGGAAGGACCGTTGaUSSTAGTTACCCCACACGACCTGACA 

T7rHisGl7AlaGl7ThrArgThrIleAlaSexProL7sGl7ProValIleGlnMetT7r 
1 31 CTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTA 
GATGGTGCCCCGGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACAT 



ThrAsnValAspGlnAspLeuValGl7TrpProAlaProGliiGl7S«rArgSerLeuThr 
241 TACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGAC 
ATGGTTACATCTGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTG 

Overlap with 8h 

P roCy sThrCy sG ly SexSer AspLeuTyrLeuValThr ArgHi s 
301 ACCCTGCACTTGCGGCTCCTCGGACCTTTACCTGGTCACGAGGCACG 
TGGGACGTGAACGCCGAGGACCCTGGAAA33GGACCAGTGCTCCGTGC 



FIG. 18 Translation of DNA 14c 



AsnMetTrpSerGlyThrPheProIleAsnAlaTyrThrThrClyProCysThrProLeu 
1 GP ACATGTGGAGTGGGACCTTCCCCATTAArGCCTACACC\CGGGCCCCTGTACCCCCCT 
CTTGTACACCTCACCCiCGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGA 

Overlap with 25c 

P roAl aP roAsnTyrThr P heAlaLeuTrpArgValS er AlaG luGluTyrValGluIle 
6 1 TCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATACGTGGAGAT 
AGGACGCGGCTTGAIGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATGCACCTCTA 



ArgGlnValGlyAspPheHisTTTValThxGlyKetTlvrthrAspAsnLeuLysCysPro 
121 AAGGCAGGTGGGGG ACTTCCACTACGTGACGGC XATG ACTACTGAGAATCrTAAATGCCC 
TTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAATTTACGGG 



CysGlnValProSerProGluPhePhe1.'_31uLeuAspGl7ValArgLeuHlsArgPhe 
1 a 1 GTGCCAGGTCCCATCGCCCGAATTTTTCACAG AATTGG ACGGGGTGCGCCTACATAGGTT 
CACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGGATGTATCCAA 

AlaProProCysLysProLeuLeuAr^GluGluValSerPheArgVeLlGlyLeuHisGlu 
241 TGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAGGACTCCACG A 
ACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATCCTGAGGTGCT 

TyrProValGlySerGlaLeuProCysGluProGluProAspValAlaValLeuThrSer 
301 ATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTC 
TATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGCACAACTGCAG 

MetLeuThrAspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGly 
361 CATGCTCACTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGG 
GTACGAGTGACTAGGGAGGG7ATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCC 

SexProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSexLeuLysAla 
4 21 ATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGC 
TAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCG 

ThrCysThrAlaAsnHisAsoSerProAso 
451 AACTTGCACCGCTAACCATGACTCCCCTGAT 
TTGAACGTGGCGATTGGTACTGAGGGGACTA 



~r u J 10 ^10 Ml 



FIG. 19 Translation of jjha 8f 

Overlap with 14c — 

SerSerSerAlaSerGlnLeuSerAlaProSerLeuLysAlaThrCysThrAlaAsnHis 
1 AGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCAT 
TCGA5GAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTCGTA 



AspSerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpAr7GlnGluMetGlyGl7 
61 GACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGGAGATGGGCGGC 
CTGAGGGGACTAC6ACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCG 

AsnlleThrArgValGluSerGluAsnLyaValVallleLeuAspSerPheAspProLeu 
121 AACATCACCAG^kiTTGAGTCAGAAAAOUUVGTGGTGATTCTGGACrCCTTC^^ 

TTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAA 

ValAlaGluGluAspGluArgGlulleSerValProAlaGluIleLeuArgLysSerArg' 
181 GTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATCCTGCGGAAGTCTCGG 
O^CCGCCTCCTCCTGCTCGCCCTCTAGAGGCAXGGGCGTCTrEAGGACGCCnTCA^ 

ArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsaProProLeuValGlu 
241 AGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAG 
TCTAAGCGGGTCCGGGACGGGCAAACCwGCGCCGGCCTGATATTGGGGGGCGATCACCTC^ 

ThrTrpLysLysProAspTyrGluProProVaivalHlsGlyCysProLeufroProPro 
301 ACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCA • 
TGCACCTTTTTCGGGCTGATGCT I'w iTGGACACCAGGTACCGACAGGCGAAGGTGGAGGT 

LysSerProProValPro 
361 AAGTCCCCTCCTGTGCCG 
TTCAGGGGAGGACACGGC 



FIG. 20 Translation of DMA 33 f 



ValTrpAlaArgProAspTyrAsnProProLeuValGluTlirTrpLysLysPrcAspTyT 
1 CGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAACCCGACTA 
GCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCrcCACCTTTTTTGGGCTGAT 

Overlap with 8f 

GluProProValValHisGlyCysProLeuProProProLysSerProProValProPro 
61 CGAACCACCTGTGGTCCATGGCTGCCCGCTTCCJVCCTCCAAAGTCCCCTCCTGTGCCTCC 
GCTTGGTGGACACCAGGTACCGACGGGCGAAGGIGGAGGTTTCAGGGGAGGACACGOAGG 

ProArgLysLysArgThrValValLeuThrGluSerThrLeuSerThrAlaLcuAlaGlu 
121 GCCTCGGAAGAAGCQSACGGTGGTCCnXJVCTGAATCAACCCTATCTACrGCCTTGGCCGA 
CGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCT 

LeuAlaThrArgSerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThr 
181 GCTCGCCACG^GyVAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGAC;^ 

CGAGCGGTGGTCrrCGAAACCGTCGAGGAGTriXyU^GGCCGTAATGCCCGCTGTTATGCTG 

ThrSerSerGluProAlaProSerGlyCysProProAspSerAspAlaGluSerPhe 
241 AACATCCTCTGAGCCCGCCCCTTCIX;GCTGCCCCCCCG ACrCCGACGCTGAGTCCTTTGC 
TTGTAGGAGAC7CGGGCG<;GGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGAAACG 



WA. J-)*;/ 



, AlaS^ArgSerPheGlySerSerSerThrSftrClyiieThrGlyAspAsnThrThrThr. 
CGGAGGTCTTCCyAACCGTCGAGGAGrreAAGGCCGTAATCCCCGCTCm 
Overlap vith 33 f-— — 

TACGGGGGGGACCTCCCCCTCG<:;XCCCTAGGCCTAGAATcS^^ 

ValSerSerGluAlaAsnAlaGluAspValValC^sCTsSeiMetScrtTrSerTrBTh*- 

^TCMTAGTG^^CCAACGCGGAGGATGTCGTGTG^ 

CAGTCATCACTCCGGTTCCGCCTCCTACAGCACACGACGAGTTAC^ 



241 




CCGCGTGAGCAGTGGGGOkCGCGGCGCCTTCrTGTCTrTGA^ 

AsnSerLeuLeuArgHlsHlsAanLeuValTyrSerrhrThrSerArgSer 
301 AACTCGTTGCTACGTCACCACaATTTCGTGTATTCCACCACCTCACGCAGTC 
ITGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTCCGTCAC 

FIG. 22 Translation of DNA 7f 

G lyThrTy rVa iTy r AsnH i sLeuThr ProLeuAr gAs pTrpAlaH i s As nG 1 vLeuArg 
1 GGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGA 
CCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCT 

AspLeuAlaValAlaValGluProValValPheSerGlnMetGluThrLysLeuIleThr 
6 1 GATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTCATCACG 
CTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGC 

TrpGlyAlaAspThrAlaAlaCysGlyAspIlelleAsnGlyLeuProValSerAlaArg 
121 TGGGGGGCAG ATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCCGC 
ACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGGCG 



ArgGlyArgGluIleLeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeu 
181 AGGGGCCGGG AGATACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGTTGG AGGTTG 
TCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCAACCTCCAAC 

LeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysIlelleThr 
24 1 CTGGCGCCCATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCACC 
GACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGG 

Overlap with 7e 

SerLeuThrGlyArgAspLysAsnGlnValGluGlyGluValGlnlleValSerThrAla 
301 AGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTGCT 
TCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGA 



361 



AlaGlnThrPheLeuAlaThrCysIleAsnGlyValCysTrp 
GCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGG 
CGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACC 



FIG. 23 Translation of DNA lib 

GlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyr 
1 GGCGGTGTTGTTCTCGTCGGGTTGATGGC^CTGACTCTGTCACCATATTACAAGCGCTAT 
CCGCCkJAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATA 

IleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHis 
61 ATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCAC 
TAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTG 

ValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeuMetCys 
121 GTGTGG ATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGT 
CACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACA 

AlaValKisProThrLeuVaiPheAspIleThrLysLeuLeuLeuAlaValPheGlyPro 
1 S 1 GCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGCCGTCTTCGGACCC 
CGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGG 

LeuTrpIleLeuGlnAlaSerLeuLeuLysValProTyrPheValArgValGlnGlyLeu 
241 CTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTT 
GAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAA 

LeuArgPheCysAlaLeuAlaArgLysMetlleGlyGlyHisTyrValGlnMetVallle 
301 CTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAATGGTCATC 
GAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAG 



IleLysLeuGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeuArgAsp 
361 ATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTCTTCGGGAC 
TAATTCAATCCCCGCGAATGACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTG 

Overlap with 7f 

TrpAlaHisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPheSerGln 
421 TGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCCAA 
ACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTT 



481 



MetGluThrLysLeuIleThrTrpGly 

ATGGAGACCAAGCTCATCACGTGGGGGGC 

TACCTCTGGTTCGAGTAGTGCACCCCCCG 



FIG. 24 Translation of DNA 14 i 

GluTyryalVaiLeuLeuPheLeuLeuLeuAlaAspAlaArgValCysSerCysLeuTrp 
1 GGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMetLeuLeuIleSerGlnAlaGluAlaAlaLeuGluAsnLeuVallleLeuAsnAla 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTrAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGqCGAAACCTCTTGGAGCATTATGAATTAC 

AlaSerLeuAlaGlyThrHisGlyLeuValSerPheLeuValPhePheCysPheAlaTrp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGG ATGTGGCCTC 
CCATAAACTTC^rATTCACCCACGGGCCTCGCCAGA'.GTGGAAGATGCCCTACACCGGAG 

LeuLeuLeuLeuLeuAlaLeuProGlriArgAlaTyrAlaLeuAspThrGluValAlaAla 
241 TCCTCCTGCTCCTG".TGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

Overlap with lib 

SezrCysGlyGlyValValLeuValGlyLeuMetAlaLeuThrLeuSerProTyrTyrLys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 



ArgTyrlleSerTrpCysLeuTrpTrpLeuGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGAA 
TCGCGATATAGTCGACCACGAACArC ' CCGAAGTCTT 

FIG. 25 Translation of DNA 39c 



ProAlaProSer<31yCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
1 CCAGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTArrCCTCCATGCCCCCC 
GGTCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGG 

LeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSer 
61 CTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACAGTCAGTAGT 
GACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGTCAGTCATCA 

Overlap with 33g 

GluAlaAsnAlaGluAspValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeu 
121 GAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCCTACTCTTGG ACAGGCGCACTC 
CTCCGGT13CGCCTCCTACAGCACACGACGAGTTACAGGATGAGAACCTGTCCGCGTGAG 



ValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeu 
181 GTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTGAGCAACTCGTTG 
CAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTACGTGACTCGTTGAGCAAC 



LeuArgHisHisAsfiLeuValTyrSerThrThrSerArgSerAlaCysGlnArgGlnLys 
241 CTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAG AAG 
GATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTC 

LysValThrPheAspArgLeuGlnValLeuAspSerHisTyrGlnAspValLeuLysGlu 
301 AAAGTCACATTTQACAGACTGCAAGTTCTGGACAGCCATTACCAGG ACGTACTC AAGG AG 
TTTCAGTGTAA^CTGtCTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCT^ 

ValLysAlaAlaAlaSerLysValLysAlaAsnPhe 
361 GTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTC 
CAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAAG 



FIG. 26-1 COMBINED ORF OF DNAs 
l4i/llb/7f/7e/8h/33c/40b/37b/35/36/81/3 2/33b/25c/l4c/8f/3 3f/33g/39c 



GluTyrValValLeuLeuPheI>*uLeuLeuAlaAspAlaArgValCysSerCysLeuTrp 
1 GGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGT 
CCCTCATGCAGCAAGAGGACAAGGAAGACGAACGTCTGCGCGCGCAGACGAGGACGAACA 

MetMeELeuLeuIleSerClnAlaGluAlaAlaLeuGluAsnLeuVallleLeuAsnAla 
61 GGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGAACCTCGTAATACTTAATG 
CCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCTTGGAGCATTATGAATTAC 

AlaSerLeiiAlaGlyThrHisGlyLeuValSerPheLeuValPhePheCysPheAlaTrp 
121 CAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCGTGTTCTTCTGCTTTGCAT 
GTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGCACAAGAAGACGAAACGTA 

TyrLeuLysGlyLysTrpValProGlyAlaValTyrThrPheTyrGlyMetTrpProLeu 
181 GGTATTTGAAGGGTAAGTGGGTGCCCGGAGCGGTCTACACCTTCTACGGGATGTGGCCTC 
CCATAAACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGAAGATGCCCTACACCGGAG 

I^uLeoLeiil^uI^uAlal^uProGlnArgAlaTyrAlaLeuAspThxGluValAl^^ 
241 TCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGCTGGACACGGAGGTGGCCG 
AGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCGACCTGTGCCTCCACCGGC 

SertysGlyGlyValValLeuValGlyLeuMetJUaLeuThrLeuSerProTyrTyrLys 
301 CGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGr.rGGCGCTGACTCTGTCACCATATTACA 
GCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGACTGAGACAGTGGTATAATGT 

ArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeuThrArgValGluAlaGln 
361 AGCGCTATATCAGCTGGTGCTTGTGGTG^C^TCAGTATTTTCTGACCAGAGTGG AAGCGC 
TCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAGACTGGTCTCACCTTCGCG 

LeuHisValTrpIleProProLeuAsnValArgGlyGlyArgAspAlaVallleLeuLeu 
4 21 AACTGCACGTGTGG ATTCCCCCCCTCAACGTCCGAGGGGGGCGCGACGCCGTCATCTTAC 
TTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATG 

MetCysAlaValHisProThrLeuValPheAspIleThrLysLeuLeuLeuAlaValPhe 
481 TCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCAAATTGCTGCTGGC 

AGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGTTTAACGACGACCGGCAGA 

GlyProLeuTrpIleLeuGlnAlaSerLeuLeuLysValProTyrPheValArgValGln 
541 TCGGACCCCTTTGG ATTCTTCAAGCCAGTTTGCTTAAAGTACCCTACTTTC 

AGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATGGGATGAAACACGCGCAGG 

GlyLeuLeuArgPheCysAlaLexiAlaArgLysMetlleGlyGlyHisTyrValGlnMet 
601 AAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCGGAGGCCATTACGTGCAAA 
TTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGCCTCCGGTAATGCACGTTT 

ValllelleLysI^uGlyAlaLeuThrGlyThrTyrValTyrAsnHisLeuThrProLeu 
661 TGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTTATAACCATCTCACTCCTC 
ACCAGTAGTAATTCAATCCCCGCGAAXGACCGTGGATACAAATATTGGTAGAGTGAGGAG 

ArgAspTrpAlaHisAsnGlyLeuArgAspLeuAlaValAlaValGluProValValPhe 
721 TTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCT 
AAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACCGACATCTCGGTCAGCAGA 

SerGlnMetGluThrLysI^uIleThrTrpGlyAlaAspThrAlaAlaCysGlyAspIle 
781 TCTCCCAAATGGAG ACCAAGCTCATCACGTGGGGGGCAG ATACCGCCGCGTGCGGTGACA 
AGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTATGGCGGCGCACGCCACTGT 

IleAsnGlyLeuProValSerAlaArgArgGlyArgGluIleLeuLeuGlyProAlaAsp 
841 TCATCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGATACTGCTCGGGCCAGCCG 
AGTAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCTATGACGAGCCCGGTCGGC 

GlyMetValSerLysGlyTrpArgLeuLeuAlaProIleThrAlaTyrAlaGlnGlnThr 



901 ATGGAATGCTCTCCAAGGGGTGGAGGTT^CTGGCGCCCATCACGGCGTACGCCCAGCAGA 
TACCTTACCAGAGGTTCCCCA XTCCAACGACCGCGGGTAGTGCCCCATGCGGGTCGTCT 

ArgGlyLeuLeuGlyCysIlelleThrSerLeuThrGlyArgAspLysAsnGlnValGlu 
961 C AAGGGGCCTCCTAGGGTGCATAATCACCAGCCTAACTGGCCGGG ACAAAAACCAAGTGG 
GTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGGCCCTGTTTTTGGTTCACC 

GlyGluValGlnlleValSerThrAlaAlaGlnThrPheLeuAlaThrCysIleAsnGly 
1021 AGGGTGAGGTCCAGATTGTGTCAACTGCTGCCCAAACCTTCCTGGCAACGTGCATCAATG 
_ TCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGGACCGTTGCACGTAGTTAC 

ValCysTrpThrValTyrHisGlyAlaGlyThrArgThJTlleAlaSerProLysGlyPro 
1031 GGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCATCGCGTCACCCAAGGGTC 
CCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTG3TAGCGCAGTGGGTTCCCAG 

VallleGlnMetTyrThrAsnValAspGlnAspLeuValGlyTrpProAlaProGlnGIy 
1141 CTGTCATCCAGATGTATACCAATGTAGACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAG 
GACAGTAGGTCTACATATGGTTACJ' TCTGC rXCTGG AACACC CG AC CGGGCGAGGCGTTC 

SerArgSerLeuThrProCysTlirCysGlySerSerAspLeuTyrLeuValThrArgHis 
1201 GTAGCCGCTCATTG ACACCCTGCACTTGCGGCTCCTCGG ACCTTTACCTGGTCACG AGGC 
. CATCGGCGAGTAACTGTGGGACGTGAACGCCGAGGAGCCTGGAAATGGACCAGTGCTCCG 

AlaAspVallleProValArgArgArgGlyAspSerArgGlySerLeuLeuSerProArg 
1261 ACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGGGCAGCCTGCTGTCGCCCC 
TGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCCCGTCGGACGACAGCGGGG 

ProIleSerTyrLeuLysGl:SerSer<;iyGlyProLeuLeuCysProAlaGiyHisAla 
1321 GGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGTTGTGCCCCGCGGGGCACG 
CCGGGTAAAGGATGAACTT7CCGAGGAGCCCCCCAGGCGACAACACGGGGCGCCCCGTGC 

ValGlyllePheArgAl^MaValCysThrArgGlyValAlaLysAlaValAspPhelle 
1381 CCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGGCTAAGGCGGTGGACTTTA 
GGCACCCGTATAAATCCCGGCGCCACACGTGGGCACCTCACCGATTCCGCCACCTGAAAT 

ProValGluAsnLeuGluThrThrMetArgSerProValPheThrAspAsnSerSerPro 
1441 TCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGTTCACGGATAACTCCTCTC 
AGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACAAGTGCCTATTGAGGAGAG 

ProValValProGlnSerPheGlnValAlaHisLeuHisAlaProThrGlySerClyLys 
1501 CACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATGCTCCCACAGGCAGCGGCA 
GTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGT 

SerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLysVaLLeuValLeuAsnPro 
1561 AAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATAAGGTGCTAGTACTCAACC 
TTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATATTCCACGATCATGAGTTGG 

SerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLysAlaHisGlylleAspPro 
1621 CCTCTGTTGCTGCAACACTCGGCTTTGGTGCTTACATGTCCAAGGCTCATC 

GGAGACAACGACGTTGTGACCCGAAACCACGAATGTACAGGTTCCGAGTACCCTAGCTAG 

AsnlleAxgThrGlyValArgThrlleThrThrClySerProIleThrTyrSerThrTyr 
1681 CTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCCCCATCACGTACTCCACCT 
GATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGGGGTAGTGCATGAGGTG^ 

GlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyrAspIlellelleCysAsp 
1741 ACGGCAAGTTCCTTGCCG ACGGCGGGTGCTCGGGGGGCGCTTATGACATAATAATTTGTG 
TGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAATACTGTATTATTAAACAC 

GluCysHisSerThrAspAlaThrSerlleLeuGlylleGlyThrValLeuAspGlnAla 
1801 ACGAGTGCCACTCCACGGATGCCACATCCATCTTGGGCATCGGCACTGTCCTTGACCAAG 
TGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGCCGTGACAGGAACTGGTTC 

GluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThrProProGlySerValThr 
1861 CAG AGACTGCGGGGGCGAGACTGGTTG7GCTCGCCACCGCCACCCCTCCGGGCTCCGTCA 
GTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGTGGGGAGGCCCGAGGCAGT 
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ValProHisProAsnlleGluGluVrlAlaLeuSerThrvhrGlyGluIleProPheTyr 
1921 CTGTGCCCCATCCCAACATCG AGGAGGTTGCTCTGTCCACCACCGG AG AG ATCCCTTTT? 
GACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGTGGCCTCTCTAGGGAAAA^\ 

GlyLysAlalleProLeuGluVairleLysGlyGlyAjrgHisLeuIlePheCysHisSer 
1981 ACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGG AGACATCTCATCTTCTGTCATT 
TGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAA 

Lj;sLysLysCysAspGluLeuAlaAiaLysLeuValAlaLeuGlyIleAsnAlaValAla 
2041 CAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCATTGGGCATCAATGCCGTGG 
GTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTAACCCGTAGTTACGGCACC 

TyrTyrArgGlyLeuAspValSerVallleProThrSerGlyAspValValValValAla 
2101 CCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCGGCGATGTTGTCGTCGTGG 
GGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGCCGCTACAACAGCAGCACC 

ThrAspAlaLeuMetThrGlyTyrThrGlyAspP'-eAspSerVallleAspCysAsnThr 
2161 CAACCGATGCCcrCATGACCGGCTATACCGGCGACTTCGACTCGGTGATAGACTGCAATA 
GTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTGAGCCACTATCTGACGTTAT. 

CysValThrGlnThrValAspPheSerLeuAspProThrPheThrlleGluThrlleXhr 
2221 CGTGTGTCACCCAG ACAGTCG ATTTCAGCCTTGACCCTACCTTCACCATTG AG ACAATCA 
GCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGAAGTGGTAACTCTGTTAGT 

LeuProGlnAspAlaValSerArgThrGlnArgArgGlyArgThrGlyArgGlyLysPro 
2281 CGCTCCCCCAGGATGCTGTCTCCCGCACTCAACGTCGGGGCAGGACTGGCAGGGGGAAGC 
GCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCG 

GlylleTyrArgPheValAlaPrcGlyGluArgProSerGlyMetPheAspSerSerVal 
2341 C AGGCATCTACAG ATTTGTGGCACCGGGGG AGCGCCCCTCCGGCATGTTCG ACTCGTCCG 
GTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGCLGTACAAGCTGAGCAGGC 

LeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeuThrProAlauluThrThr 
2401 TCCTCTGTGAGTGCTATGACGCAGGCTGTGCTTGGTATGAGCTCACGCCCGCCGAv'ACTA 
AGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCGAGTGCGGGCGGCTCTGAT 

ValArgLeuArgAlaTyrMetAsnThrProGlyLeuProValCysGlnAspHisLe.iGlu 
24 61 CAGTTAGGCTACG AGCGTACATGAACACCCCGGGGCTTCCCGTGTGCCAGG ACC ATCr.TC 
GTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGCACACGGTCCTGGTAGAiVC 

PheT rpGluGlyValPheTlirGlyLeuThrHlsIleAspAlaHisPheLeuSer<51nThr 

2521 aattttgggagggcgtctttacaggcctcactcatatagatgcccactttctatcccag a 
ttaaaaccctcccgcagaaatgtccggagtgagtatatctacgggtgaaagatagggtv:t 

LysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGlnAlaThrValCysAlaArg 
2581 caaagcagagtggggagaaccttccttacctggtagcgtaccaagccaccgtgtgcgcta 
gtttcgtctcacccctcttggaaggaatggaccatcgcatggttcggtggcacacg 

AlaGlnAlaProProProSerTrpAspGlnMetTrpLysCysLeuIleArgLeuLysPro 
2641 GGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGG AAGTGTTTGATTCGCCTCAAGC 
CCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAGTTCG 

ThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGluIleThr 
2701 CCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCGCTGTTCAG AATG AAATCA 
GGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACTTTAGT 

LeuXhrHisProValThrLysTyrllcMetThrCysMetSerAlaAspLeuGluValVal 
2761 CCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGG AGGTCG 
GGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTCCAGC 

ThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeuAlaAlaTyrCysLeuSer 
2821 TCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGT 
AGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACGGACA 

ThrGlyCysValVallleValGlyArgVaiValLeuSerGlyLysProAlallellePro 
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2881 CAACAGGCTGCGTGGTCATAGTGGGCAGGGTCG rCTTGTCCGGGAAGCCGGCAATCATAC 
GTTGTCCGACGCACCAGTATCACCCGrCCCAGCAGAACAGGCCCTTCGGCCGTTAGTATG 

AspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHisLeuPro 
2941 CTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCACTTAC 
GACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTCTCACGAGAGTCGTGAATG 

TyrlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGlyLeuLeu 
3001 CGTACAtCGAGCAAGGGATGATGCTCGCCG AGCAGTTCAAGCAGAAGGCCCTCGGCCTCC 
GCRTGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGG 

GlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrpGlnLys 
3061 TGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCCAGACCAACTGGCAAA 
ACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGTTGACCGTTT 

LeuGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyrLeuAla 
3121 AACTCG AGACCTTCTGGGCGAAGCATATGTGGA \CTTCATCAGTGGGATACAATACTTGG 
TTGAGCTCTC^AAGACCCGCTTCGTATACACCTTiiAAGTAGTCACCCTATGTTATGAACC 

GlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeuMetAlaPheThrAlaAla 
3181 CGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTCATTGATGGCTTTTACAGCTG 
GCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGTCGAC 

ValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGlyTrpVal 
3 241 CTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACAT ATTGGGGGGGTGGG 
GACAGTGGTCGGGTGATTGGTGATCGGTTTGGGAGGAGAAGTTGTATAACCCCCCCACCC 

AlaAlaGlnLeuAlaAlaProGl>;^aAlaThrAlaPheValGlyAlaGlyLeuAlaGly 
3301 TGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTG 
ACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAATCGAC 

AlaAlalleGlySerValGl-'I ' uGlyLysValLeuIleAspIleLeuAlaGlyTyrGly 
3361 GCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAGACATCCTTGCAGGGTATG 
CGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCCATAC 

AlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSerGlyGluValProSerThr 
34 21 GCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGAGCGGTGAGGTCCCCTCCA 
CGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGGAGGT 

GluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValValGlyVal 
3481 CGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTCGGCG 
GCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAGCCGC 

ValCysAlaAlalleLeuArgArgHisValGlyProGlyGIuGlyAlaValGlnTrpMet 
3541 TGGTCTGTGCAGCAATACrcCGCCGGCACGTTGGCCCGGGCGAGGGGGCAGTGCA^ 

ACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTCACGTCACCT 

AsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSerProThrHisTyrValPro 
3601 TGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCCACGCACTACGTGC 
ACTTCGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGGTGCGTGATGCACG 

GluSerAspAlaAlaAlaArgValThrAlalleLeuSerSerLeuThrValThrGlnLeu 
3661 CGGAG AGCGATGCAGCTGCCCGCGTCACTGCCATACTC AGCAGCCTCACTGTAACCCAGC 
GCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGTCGGAGTGACATTGGGTCG 

LeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThrProCysSerGlySerTrp 
3721 TCCTGAGGCGACTGCACCAGTGGATAAGCTCGG AGTGTACCACTCCATGCTCCGGTTCCT 
AGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGTGAGGTACGAGGCCAAGGA 

LeuAxgAspIleTrpAspTrpIleCysGluValLeuSerAspPheLysThrTrpLeuLys 
3781 GGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCGACTTTAAGACCTGGCTAA 
CCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGCTGAAATTCTGGACCGATT 

AlaLysLeuMetProGlnLeuProGlylleProPheValSerCysGlnArgGlyTyrLys 
3 841 AAGCTAAGCTCATGCCACAGCTGCCTGGGATCCCC7TTGTGTCCTGCCAGCGCGGGTATA 
TTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACAGGACGGTCGCGCCCATAT 

FIG. 26-4 



GlyVairrpArgValAspGly.leMetHisThrArqCysHlsCysGlyAlaGluIleThr 
3 901 AGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCCACTGTGGAGCTGAGATCA 
TCCCCCAGACC'GCTCACCTGCCGTAGTACGTGTGAGCGACGGTGACACCTCGACTCTAGT 

GlyHisValLysAsnGlyThrMetArglleValGlyProArgThrCysArgAsnMetTrp 

3 961 CTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTAGGACCTGCAGGAACATGT 

GACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGATCCTGGACGTCCTTGTACA 

SerGlyThrPheProIleAsnAiaTyrThrThrGlyProCysThrProLeuProAIaPro 

4 021 GG AGTGGG ACCTTCCCCATTAATGCCTACACCACGGGCCCCTGTACCCCCCTTCCTGCGC 

CCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGACATGGGGGGAAGGACGCG 

AsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluT-yrValGluIleArgGlnVal 
4 081 CG AACTACACGTTCGCGCTATGGAGGGTGTCTGCAGAGGAATATGTGGAGATAAGGCAGG 
GCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTATACACCTCTATTCCGTCC 

GlyAs'^PheHisTyrValThrGlyMctThirhrAspAsnLeuLysCysProCysGlnVal 
4141 TGGGGG ACTTCCACTACGTGACGGGTATGACTACTGACAATCTCAAATGCCCGTGCCAGG 
ACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAGAGTTTACGGGCACGGTCC 

ProSerProGluPhePheThxGluLeuAspGlyValArgLeuHisAxgPheAlaProPro 
4 201 TCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCCTACATAGGTTTGCGCCCC . 
AGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGCATGTATCCAAACGCGGGG 

CysLysProLeuLeuArgGluGluValSerPheArgValGlyLeuHisGluTyrProVai 
4 261 CCTGCAAGCCCTTGCTGCGGGAGG AGGTATCATTCAGAGTAGGACTCCACG AATACCCGG 
GGACGTTCCGGAACGACGCCCTCCTOCATAGTAAGTCTCATCCTGAGGTGCTTATGGGCC 

GlySerGlnLeuProCysGluProGluProAspValAlaValLeuThrSerMetLeuThr 
4 321 TAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCGTGTTGACGTCCATGCTCA 
ATCCCAGCGTTAATGGAACG':T''GGGCTTGGCCTGCACCGGCACAACTGCAGGTACGAGT 

AspProSerHisIleThrAlaGluAlaAlaGlyArgArgLeuAlaArgGlySerProPro 
43 81 CTGATCCCTCCCATATAACAGCAGAGGCGGCCGGGCGAAGGTTGGCGAGGGGATCACCCC 
GACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGG 

SerValAlaSerSerSerAlaSerGlnLeioSerAlaProSerLeuLysAlaThrCysThr 
4441 CCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCATCTCTCAAGGCAACTTGCA 
GGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTAGAGAGTTCCGTTGAACGT 

Al5LAsnHisAspSerProAspAlaGluLeuIleGluAlaAsnLeuLeuTrpArgGlnGlu 
4501 CCGCTAACCATGACTCCCCTGATGCTGAGCTCATAGAGGCCAACCTCCTATGGAGGCAGG 
GGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGTTGGAGGATACCTCCGTCC 

MetGlyGlyAsnlleThrArgValGluSerGluAsnLysValVallleLeuAspSerPhe 
4561 AGATGGGCGGCAACATCACCAGGGTTGAGTCAG AAAACAAAGTGGTGATTCTGG ACTCC 
TCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTCACCACTA^^ 

AspProLeuValAlr-GluGluAspGluArgGluIleSerValProAlaGluIleLeuArg 
4621 TCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCGTACCCGCAGAAATCCTGC 
AGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGCATGGGCGTCTTTAGGACG 

LysSerArgArgPheAlaGlnAlaLeuProValTrpAlaArgProAspTyrAsnProPro 
4 681 GGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGCGGCCGGACTATAACCCCC 
CCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCGCCGGCCTGATATTGGGGG 

LeuValGluThrTrpLysLysProAspTyxGluProProValValHlsGlyCysProLeu 
4741 CGCTAGTGG AGACGTGGAAAAAGCCCGACTACGAACCACCTGTGGTCCATGGCTGTCCGC 
GCGATCACCTCTGCACCTTTTTCGGGCTGATGCTTGGTGGACACCAGGTACCGAC^^^ 

ProProProLysSerProProValProProProArgLysLysArgThrValValLeuThr 
4 801 TTCCACCTCCAAAGTCCCCTCCTGTGCCTCCGCCTCGG AAGAAGCGGACGGTGGTCCTCA 
AAGGTGGAGGTTTCAGGGGAGGACACGGAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGT 

GluSerThrLeuSerThrAlaLeuAlaGluLeuAlaThrArg-SerPheGlySerSerSer 

FIG. 26-5 



4 861 CTG AATCAACCCTATCTACTGCCTTGGCCG AGCTCGCC ACCAG AAGCTTTGGCAGCTCCT 
GACTTAGTTGGGATAGATCACGGAACCGGCTCGAGCGGTGGTCTTCGAAACCCTrGAGGA 

ThrSerGlylleThrGlyAspAsnThrThrThrSerSerGluProAlaProSerGiyCys 
4 921 CAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTG AGCCCGCCCCTTCTG CT 
GTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGACTCGGGCGGGGAAGACCGA 

ProProAspSerAspAlaGluSerTyrSerSerMetProProLeuGluGlyGIuProGly 
4 981 GCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCCCCCTGG AGGGGG AGCCTG 
CGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGGGGGACCTCCCCCTCGGAC 

AspProAspLeuSerAspGlySerTrpSerThrValSerSerGluAlaAsnAlaGloAsp 
5041 GGGATCCGGATCTTAGCGACCGJTCATGGTCAACGGTCAGTAGTGAGGCCAACGCGGAGG 
CCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCaLGTCATCACTCCGGTTGCGCCTCC 

ValValCysCysSerMetSerTyrSerTrpThrGlyAlaLeuValThrProCysAlaAla 
5101 ATGTCGTGTGCTGCTCAA.?^TCTTACTCTTGG ACAGGCGCACTCGTCACCCCGTGCGCCG 
TACAGCACACGACGAGTTACAGAATGAGAACCTGTCCGCGTGAGCAGTGGGGCACGCGGC 

GluGluGlnLysLeuProIleAsnAlaLeuSerAsnSerLeuLeuArgHisHisAsnLeu 
5161 CGGAAGAACAGAAACTGCCCATCAATGCACTAAGCAACTCGTTGCTACGTCACC ACAATT 
GCCTTCTTGTCTTTGACGGGTAGTTACGTGATTCGTTGAGCAACGATGCAGTGGTGTTAA 

ValTyrSerThrXhrSerArgSerAlaCysGlnArgGlnLysLysValThrPheAspArg 
5221 TGGTGTATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGAAGAAAGTCACATTTGACA 
ACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCTTCTTTCAGTGTAAACTGT 

LeuGlnValLeuAspSerHisTyrGlnAspValLeuLysGluValLysAlaAlaAlaSer 
5281 GACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGGAGGTTAAAGCAGCGGCGT 
CTGACGTTCAAGACCTGTCGGTAATGGTCCTGCATGAGTTCCTCCAATTTCGTCGCCGCA 

LysValLysAlaAsnLeu 
5341 CAAAAGTGAAGGCTAACTTG 
GTTTTCACTTCCGATTGAAC 



FIG. 26-6 



FIG, 27 Translation of DNA 12f 

IlePheLysIleArgMetTyrValGlyGlyValGluVisArgLeuGluAlaAlaCysAsn 
i CCATATTTAAAATCACo-ATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGT 

TrpThrArgGlyGluArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGT 
TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

LeuLeuThrThrThrGlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCT 
ATGACGACTGGTGATGTGTCACCGTCCAGGACGGCACAAGGAAGTGTTGGGATGGTCGGA 

SerThrGlyLeuIleHisLeuHisGlnAsnlleValAspValGlnTyrLeuTyrGlyVal 
181 TGTCCACCGGCCTCATCCACCTCCACCAGAACATTGTGGACGTGCAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGCT-, .TGTAACACCTGCACGTCATGAACATGCCCC 



GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAA.G 



LeuAlaAspAlaArgValCysSerCysLeuTrpMetMetLeuLeuIleSerGlnAlaGlu 
301 TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

Overlap with 14i 

AlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAG AACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 



Val 

4 21 TTGTATC 
AACATAG 



FIG. 28 Translation of DNA 35 f 



Overlap with 39c 

LeoL/sGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerValGluGlu 
1 TGCTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCGTAGAGG 
ACGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGCATCTCC 

AlaCysSerLeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAsp 
61 AAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAG 
TTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTC 

ValArgCysHisAlaArgLysAlaValThrHisIleAsnSerValTroLysAspLeuLeu 
121 ACGTCCGTTGCCATGCCAGAA;i.s,GCCGTAACCCACATCAACTCCGTGTGGAAAG ACCTTC 
TGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAG 

GluAsDAsnValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysVal 
181 TGG AAG ACAATGTAACP ATAG ACACTACCATCATGGCTAAG AACG AGGTTTTCTGCG 
ACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGC 

GlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeuGlyVal 
241 TTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCC 
AAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGC 

ArgValCysGluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaValMet 
301 TGCGCGTGTGCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGA 
ACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGTTTC3AGGGGAACCGGCACT 

GlySerSerTyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAla 
361 TGGGAAGCTCCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAG 
ACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTC 

TrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThr 
^21 CGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATG ATACCCGCTGCTTTG ACTCCA 
GCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGT 

ValThrGluSerAsDileArgThrGluGluAla 
481 CAGTCACTGAGAGCGACATCCGTACGGAGG AGGCA 
GTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGT 



EP 0 318 216 Al 



FIG. 29 Translation of DNA 19g 



GluPheLeuValGlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThr 
1 GAATTCCTCGTGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACC 
CTTAAGGAGCACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGG 

Overlap with 35f 

ArgCysPheAspSerThrValThrCluSerAspIleArgThrGluGluAlalleTyrQln 
61 CGCTGCTTTGACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAA 
GCGACGAAACTGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTT 

CysCysAspLeuAspProG] nl" ^ aArgValAlalleLysSerLeuThrGluArgLeuTyr 
121 TGTTGTGACCTCGACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTAT 
ACAACACfGGAGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATA 

ValGlyGlyProLeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAla 
181 GTTGGGGGCCCTCTTACCAATTCAAGGGGGG AG AACTGCGGCTATCGCAGGTGCCGCGCG 
CAACCCCCGGGAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGC 

SerGlyValLeuThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAla 
241 AGCGGCGTACTGACAACTAGCTGTGGTAAC ACCCTCACTTGCTACATCAAGGCCCGGGCA 
TCGCCGCATGACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGT 

AlaCysArgAlaAlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuVal 
301 GCCTGTCGAGCCGCAGGGCTCCAGG ACTGCACCATGCTCGTGTGTGGCGACGACTTAGTC 
CGGACAGCTCGGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAG 

ValHeCysGluSerAlaGlyValGlnGluAspAlaAla 
361 GTTATCTGTGAAAGCGCGGGGGTCCAGGAGGACGCGGCGAG 
CAATAGACACTTTCGCGCCCCCAGGTCCTCCTGCGCCGCTC 



FIG. 30 Trani;Iation of DNA 26q 



GlyGLyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyVaiLeuThrThrSerCys 
I GGGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCAAGCGGCGTACTGACAACTAGCTGT 
CCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGTTCGCCGCATGACTGTTGATCGACA 

GlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGln 
61 GGTAACACCCTCACTTGTTACATCAAGGCCCGAGCAGCCTGTCGAGCCGCAGGGCTCCAG 
CCATTGTGGGAGTGAACAATGTAGTTCCGGGCTCGTCGGACAGCTCGGCGTCCCGAGGTC 

Overlap with 19g 

AspCysThrMetLeuValCysGlyAspAspLeuValVallleCysGluSerAlaGLyVal 
121 GACTGCACCATGCTCGTGTGTGGCGACGACTTAG'^CGTTATCTGTG AAAGCGCGGGGGTC 
CTGACGTGGTA^GAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAG 

GlnGluAspAiaAlaSerLeuArgAlaPheThrGluAlaiMetThrArgTyrSerAlaHro 
181 CAGG AGGACGCGGCGAGCCTG AGAGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCC 
GTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGG 

ProGlyAspProProGinProGluTyrAspLeuGluLeuUeThrSerCysSerSerAsn 
241 CCTGGGG ACCCCCC ACAACCAG AATACGACTTGG AGCTCATAACATCATGCTCCTCCAAC 
GGACCCCTGGGGGGTGTTGGTCTTATG^TGAACCTCGAGTATTGTAGTACGAGGAGGTTG 

ValSerValAlaHisAspGlyAlaGlyLysArgValTvrTyrLeuThrArgAspProThr 

3 0 1 gtgtcagtcgcccacgacggcgctggaaagagggtctactacctcacccgtgaccctaca 

cacagtcagcgggtgctgccgcg:.:jtttctcccagatgatggagtgggcactgggatgt 

ThrProLeuAlaArgAlaAlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeu 
361 ACCCCCCTCGCGAGAGCTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTA 
TGGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGAT 

GlyAsnllelleMetPheAlaProThrLeuTrpAla 

4 21 GGCAACATAATCATGTTTGCCCCCACACTGTGGGCG 

CCGTTGTATTAGTACAAACGGGGGTGTGACACCCGC 



FIG. 31 Translation of DNA 15e 



G 1 yAl aG lyLy sArgVa iTy rTy r LeuThr Ar g As p P r oThrThr P roLeuAl aAr gAl a 
1 CGGCGCTGGAAAGAGGGTCTACTACCTCACCCGTGACCCTACAACCCCCCTCGCGAGAGC 
GCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTTGGGGGGAGCGCTCTCG 

Overlap with 26g 

AlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeuGlyAsnllelleMetPhe 
61 TGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAGGCAACATAATCATGTT 
ACGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATCCGTTGTATTAGTACAA 



AlaProThrLeuTrpAlaArgMetlleLeuMetThrHisPhePheSerValLeuIleAla 
121 TGCCCCCACACTGTGGGCGAGG ATGATACTGATG ACCCATTTCTTTAGCGTCCTTATAGC 
ACGGGGGTGTGACACCCGCTCCTACTATGACTACTGGGTAAAGAAATCGCAGGAATATCG 

ArgAspGlnLeuGluGlnAlaLeuAspCysGluIleTyrGlyAlaCysTyrSerlleGlu 
1 S 1 CAGGG ACCAGCTTG AACAGGCCCTCG ATTGCG AG ATCT ACGGGGCCTGCTACTCCATAG A 
GTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCCGGACGATGAGGTATCT 

ProLeuAspLeuProProIlelleGlnArqLeu 
24 1 ACCACTTGATCTACCTCCAATCATTC;uV.GACTC 
TGGT^AACTAGATGGAGGTTAGTAAGTTTCTGAG 



EP 0 3ltt 216 Al 



FIG. 32-1 COMBINED ORF OF DNAiJ 12 f throuyh 15e 



IlePheLysIleArgMetTyrVdlGlyGlyValGluHisArgLeuGluAlaAlACysAsn 
1 CCATATTTAAAATCAGGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCA 
GGTATAAATTTTAGTCCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGT 

TrpThrArgGlyGluArg.CysAspLe-iGluAspArgAspArgSerGluLeuSerProLeu 
61 ACTGGACGCGGGGCGAACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGT 
' TGACCTGCGCCCCGCTTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCA 

LeuLeuThrThrThrGlnTrpGInValLeuProCysSerPheThrThrLeuProAlaLeu 
121 TACTGCTGACCACTACACAGTGGCAGGTCCTCCCGTGTTCCTTCACAaCCCTACCAGCCT 
ATGACGACTGGTGATGTGTCACCGTCCAGGAGGGCACAAGGAAGTGTTGGGATGGTCGGA 

SerThrGlyLeuIleHisLeuHisClnAsnlleValAspValGinTyrLeuTyrGlyVal 
1 8 1 TGTCCACCGGCCr<^ATCCACCTCCACCAGAACATTGloGACGTGCAGTACTTGTACGGGG 
ACAGGTGGCCGGAGTAGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCC 

GlySerSerlleAlaSerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeu 
241 TGGGGTCAAGCATCGCGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTC 
ACCCCAGTTCGTAGCGCAGGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAAG 

LeuAiaAspAlaArgValCysSerCysLeuTrpMetMetLeuLeuIleSerGlnAlaGlu 
301 TGCTTGCAGACGCGCGCGTCTGCTCCTGCTTGTGGATG ATGCTACTCATATCCCAAGCGG 
ACGAACGTCTGCGCGCGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCC 

AlaAlaLeuGluAsnLeuVallleLeuAsnAlaAlaSerLeuAlaGlyThrHisGlyLeu 
361 AGGCGGCTTTGGAG AACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTC 
TCCGCCGAAACCTCTTGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAG 

ValSerPheLeuValPhePheCysPheAlaTrpTyrLeuLysGlyLysTrpValProGly 
4 21 TTGTATCCTTCCTCGTGTTCTTCTGCTTTGCATGGTATTTGAAGGGTAAGTGGGTGCCCG 
AACATAGGAAGGAGCACAAGAAGACGAAACGTACCATAAACTTCCCATTCACCCACGGGC 

AlaValTyrThrPheTyrtlyMetTrpProLeuI^uLeuI^uLeuLeuAlaLeuProGln 
4 81 GAGCGGTCTACACCTTCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCC 
CTCGCCAGATGTGGAAGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGG 

ArgAlaXyrAlaLeuAspThrGluValAlaAlaSerCysGlyGlyValValLeuValGly 
541 AGCGGGCGTACGCGCTGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCG 
TCGCCCGCATGCGCGACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACAAGAGCAGC 

LeuMetAlaLeuThrLeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrp 
601 GGTTGATGGCGCTGACTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGT 
CCAACTACCGCGACTGAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCA 

LeuGlnTyrPheLeuThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsn 
661 GGCTTCAGTATTTTCTGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCA 
CCGAAGTCATAAAAGACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGT 

ValArgGlyGlyArgAspAlaVallleLeuLeuMetCysAlaValHisProThrLeuVal 
721 ACGTCCGAGGGGGGCGCGACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGG 
TGCAGGCTCCCCCCGCGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACC 

PheAspIleThrLysLeuLeuLeuAleValPheGlyProLeuTrpIleLeuGlnAlaSer 
781 TATTTG ACATCACCAAATTGCTGCTGGCCGTCTTCGGACCCCTTTCGATTCTTCAAGCOV 
ATAAACTGTAGTGGTTTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGT 

LeuLeuLysValProTyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAla 
841 GTTTGCTTAAAGTACCCTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAG 
CAAACGAATTTCATGGGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCA.MC 

ArgLysMetlleGlyGlyHisTyrValGlnMetValllelleLysLeuGlyAlaLeuThr 
901 CGCGGAAGATGATCGGAGGCCATTACGTGCi\AATGGTCATCATTAAGTTAGGGGCGCTTA 
GCGCCTTCTACTAGCCTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAAT 



GlyThrTyrValTyrAsnMisLftuThrProLeuArgAspTrpAldHisAsnGlyLeuArg 
961 CTGGCACCTATGTTTATAACCATC'i CACTCCTCTTCGGG ACTGGGCGCACAACGGCTTGC 
GACCGTGGATACAAATATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACG 

AspLeuAlaValAla'/alGluProValValPheSerGliiMetGluThrLysLeuIieThr 
1021 GAGATCTGGCCGTGGCTGTAGAGCCAGTCGTCTTCTCCC/\AATGGAGACCAAGCTCATCA 
CTCTAGACCGGCACCGACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGT 

TrpGlyAlaAspThrAlaAlaC/sGlyAspUelleAsnGlyLeuProValSerAlaArg 
1081 CGTGGGGGGCAG ATACCGCCGCGTGCGGTGACATCATCAACGGCTTGCCTGTTTCCGCCC 
GCACCCCCCGTCTATGGCGGCGCACGCCACTGTAGTAGTTGCCGAACGGACAAAGGCGGG 

ArgGlyArgGluIleLeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeu 
114 1 GC AGGGGCCGGGAG ATACTGCTCGGGCCAGCCGATGG AATGGTCTCCAAGGGGTGGAGGT 
CGTCCCCGGCCCTCTATGACGAGCCCGGTCGGCTACCTTACCAGAGGTTCCCCACCTCCA 

LeuAlaProIleThrAlaTyrAlaGlnGlnThrArgGlyLeuLeuGlyCysIlelleThr 
1201 TGCTGGCGC:CATCACGGCGTACGCCCAGCAGACAAGGGGCCTCCTAGGGTGCATAATCA 
ACGACCGCGGGTAGTGCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGT 

SerLeuThrGIyArgAspLysAsnGlnValGluGlyGluValGInlleValSerThrAla 
1261 CCAGCCTAACTGGCCGGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTG 
GGTCGGATTGACCGGCCCTGTTTTTGGTTCACCTCCCACTCCAGGTCIAACACAGTTGAC 

AlaGlnThrPheLeuAlaThrCysIleAsnGlyValCysTrpThrValTyrHisGlyAla 
1321 CTGCCCAAACCTTCCTGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCTACCACGGGG 
GACGGGTTTGGAAGGACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCC 

GlyThrArgThrlleAlaSerProLysGlyProVallleGlnMetTyrThrAsnValAsp 
1381 CCGGAACGAGGACCATCGCGTCACCCAAGGGTCCTGTCATCCAGATGTATACCAATGTAG 
GGCCTTGCTCCTGGTAGCGCAGTGGGTTCCCAGGACAGTAGGTCTACATATGGTTACATC ' 

GInAspLeuValGlyTrpProAlaProGlnGlySerArgSerLeuThrProCysThrCys 
14 41 ACCAAGACCTTGTGGGCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTT 
TGGTTCTGGAACACCCGACCGGGCGAGGCGTTCCATCGGCGAGTAACTGTGGGACGTGAA 

GlySerSerAspLeuTyrLeuValThrArgHisAlaAspVallleProValArgArgArg 
1501 GCGGCTCCTCGG ACCTTTACCTGGTC ACGAGGCACGCCG ATGTCATTCCCGTGCGCCGGC 
CGCCGAGGAGCCTGGAAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCG 

GlyAspSerArgGlySerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSer 
1561 GGGGTG ATAGCAGGGGCAGCCTGCTGTCGCCCCGGCCC ATTTCCT;.CTTG AAAGGCTCCT 
CCCCACTATCGTCCCCGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGA 

GlyGlyProLeuLeuCysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCys 
1621 CGGGGGGTCCGCTGTTCTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGT 
GCCCCCCAGGCGACAACACGGGGCGCCCCGTGCGGCACCCGTATAAATCCCGGCGCCACA 

ThrArgGlyValAlaLysAlaValAspPKelleProValGluAsnLeuGluThrThrMet 
1681 GCACCCGTGGAGTGGCTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCA 
CGTGGGCACCTCACCGATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGT 

ArgSerProValPheThrAspAsnSerSerProProValValProGlnSerPheGlnVal 
1741 TGAGGTCCCCGGTGTTCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGG 
ACTCCAGGGGCCACAAGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCC 

AlaHisLeuHisAlaProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAla 
1801 TGGCTCACCTCCATGCTCCCAC AGGCAGCGGCAAAAGC ACCAAGGTCCCGGCTGCATATG 
ACCGAGTGGAGGTACGAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATAC 

AlaGlnGlyTyrLysValLeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGly 
1861 CAGCTCAGGGCTATAAGGTGCTAGTACTCAACCCCTCTGTTGCTGCAACACTGGGCTTTG 
GTCGAGTCCCGATATTCCACGATCATGAGTTGGGGAGACAACGACGTTGTGACCCGAAAC 

AlaTyrMetSerLysAlaHisGlylleAspProAsnlle.-aLrgThrGlyValArgThrlle 

FIG. 32-2 



1921 GTGCTTACATGTCCAAGGCTCATGGG ATCG ATCC7AACATCAGGACCGGGGTG AGAAO^ J> 
CACGAATGTACAGGTTCCGAG'ACCCTAGCTAGGAVTGTAGTCCTGGCCCCACTCTTGTT 

ThrThrGlySerProIleThrTyrSerThrTyrGl'/LysPheLeuAlaAspGlyGlyCvs 
1981 TTACCACTGGCAGCCCCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGACGGCGGGT 
AAl-GGTGACCGTCGGGGTAGTGCATGAGGTGGATCCCGTTCAAGGAACGGCTGCCGCCCA 

, -SerGlyGlyAlaTyrAspllellelleCysAspGluCysHisSerThrAspAlaThrSer 
2 0 4 1_ GCTCGGGGGGCGCTTATGACATAATAATTTG TG ACGAGTGCCACTCCACGGATGCCACAT 
CGAGCCCCCCGCGAATACTGTATTATTAAA'JACTGCTCACGGTGAGGTGCCTACGGTGTA 

Ai ^^?i^i;^5iX^^^^^y^*^^''^^^'^sP^l"AlaGl"ThrAlaGlyAlaArgLeuValVal 
2101 CCATCTTGGGCATCGGCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTG 
GGTAGAACCCGTAGCCGTGACAGGAACTGGTTCGTCTCTG/.CGCCCCCGCTCTGACCAAC 

. , _^«"AlaThrAlaThrProProGlySerValThrValProHisProAsnIleGluGluVal 
2161 TGCTCGCCACCGCCACCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGG 
ACGAGCGGTGGCGGTGGGGAGGCCCGAGGCGTGACACGGGGTAGGGTTGTAGCTCCTCC 

AlaLeuSerThrThrGlyGluIleProPheTyrGlyLysAlalleProLeuGluVallle 
2221 TTGCTCTGTCCACCACCGGAGAGATCCCTTTTTACGGCAAGGCTATCCCCCTCG AAGTAA 
AACGAGACAGGTGGTGGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATT 

^ysGlyGlyArgHisLeullePheCysHisSerLysLysLysCvsAspGluLeuAlaAla 
2231 TCAAGGGGGGGAGACATCTCATCTTCTGTCATTCAAAGAAG AAGTGCGACGAACTCGCCG 
AGTTCCCCCCCTCTGTAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGC 

■ LysLeuValAlaLeuGlylleAsnAlaValAlaTyrTyrArgGlyLeuAspValSerVal 
2341 CAAAGCTGGTCGCATTGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTCTCCG 
GTTTCGACCAGCGTAACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGC 

IleProThrSerGlvA.«pValValValValAiaThrAspAlaLeuMetThrGlyTyrThr 
24 01 TCATCCCGACC?>.CCGGCGA'1GTTGTCGTCGTGGCAACCGATGCCCTCATGACCGGCTATA 
AGTAGGGCTGGTCGCCGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATAT 

r 1 ^^5^y^=PP^^spSerValIleAspCysAsnThrCysValThrGlnThrValAspPheSer 
24 61 CCGGCGACTTCGACTCGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCG ATTTCA 
GGCCGCTGAAGCTGAGCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGT 

-,c-,, ^ LeuAspProThrPheThrlleGluThrlleThrLeuProGlnAspAlaValSerArgThr 
2521 GCCTTGACCCTACCTTCACCATTG AGACAATCACGCTCCCCCAGG ATGCTGTCTCCCGCA 
CGGAACTGGGATGGAAGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGT 

->«; a 1 J?^9AxgGlyArgThx<31yArgGlyLysProGly IleTyrArgPheValAlaProGly 
2581 CTCAACGTCGGGGCAGGACTGGCAGGGGGAAGCCAGGCATCTACAGATTTCTGGCACCGG 
GAGTTGCAGCCCCGTCCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTCGCC 

^ GluArgProSerGlyMetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCys 
2641 GGGAGCGCCCCTCCGGCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTATG ACGCAGGCT 
CCCTCGCGGGGAGGCCGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGA 

AlaTrpTyrGluLeuThrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThr 
2701 GTGCTTGGTATGAGCTCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATCAACA 
CACGAACCATACTCGAGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGT 

„ . , ^^Pi^oGlyLeuProValCysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeu 
2/61 CCCCGGGGCTTCCCGTGTGCCAGGACCATCTTGAATTTTGGGAGGGCGTCTTTACAGGCC 
GGGGCCCCGAAGGGCACACGGTCCTGGTAGAACTTAAAACCCTCCCGCAGAAATGTCCGG 

ThrHisIleAspAlaHisPheLeuSerGlnThrLyGGlnSerGlyGluAsnLeuProTyr 
2821 TCACTCATATAGATGCCCACTTTCTATCCCAG ACAAAGCAG AGTGGGGAGAACCTTCCTT 
AGTGAGTATATCTACGGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAA 

LeuValAlaTyrGlnAlaThrValCysAl.aArgAlaGlnAlaProProProSerTrpAsp 
2881 ACCTGGTAGCGTACCAAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTCGG 
TGGACCATCGCATGGTTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCC 
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GlnMetTrpLysCysLeuIleArgLriuLysProThrLeuHisGlyPrcThrProLeuLeu 
294 1 ACCAG ATGTGG AAGTGTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGC 
TGGTCTACACCTTCACAjtW\CTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACG 

TyrArgLeuGlyAlaValGlnAsnGluIleThrLeuThrHisProValThrLysTyrlle 
3001 TATACAGACTGGGCGCTGTTCAGAATGAAATCACCCTGACGCACCCAGTCACCAAATACA 
ATATGTCTGACCCGCGACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGT 

MetThrCysMetSerAlr.AspLeuGluValValThrSerThrTrpValLeuValGlyGly 
3061 TCATGACATGCATGTCGGCCGACCTGGAGGTCGTCACGAGCACCTGGGTGCTCGTTGGCG 
AGTACTGTACGTACAGCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGC 

ValLeuAlaAlaLeuAlaAlaTyrCysLeuSerThrGlyCysVa IVallleValGlyArg 
3121 GCGTCCTGGCTGCTTTGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCA 
CGCAGGACCGACGAAACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGT 

ValValLeuSerGlyLysProAlallelleProAspArgGluValLeuTyrArgGluPhe 
3181 GGGTT'CTCTTGTCCGGGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCGAGAGT 
CCCAGCAGAACAGGCCCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCA 

AspGluMetGluGluCysSerClnHisLeuProTyrlleGluGInGlyMetMetLeuAla 
3241 TCG ATG AG ATGG AAG AGTGCTCTCAGCACTTACCGTACATCG AGC AAGGG ATG ATGCTCG 
AGCTACTCTACCTTCTCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGC 

GluGlnPheLysGlnLysAlaLeuGlyLeoLeuGlnThrAlaSerArgGlnAlaGluVal 
3 301 CCG AGCAGTTCAA.GCAGAAGGCCCTCGGCCTCCTGCAG ACCGCGTCCCGTCAGGCAG AGG 
GGCTCGTCAAGTTCGTCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCC 

IleAlaProAlaValGlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMet 
3 361 TTATCGCCCCTGCTGTCCAGACCAACTGGCAAAAACTCGAG ACCTTCTGGGCGAAGCATA 
AATAGCGGGGACGACAGGTCTGGTTGACCGTTTTTGAGCTCTGGAAGACCCGCTTCGTAT 

TrpAsnPhelleSerGlylleGlnTyrLeuAlaGlyLeuSerThrLeuProGlyAsnPro 
34 21 TGtGGAACTTCATCAGTGGG ATACAATACTTGGCGGGCTTGTCAACGCTGCCTGGTAACC 
ACACCTTGAAGTAGTCACCCTATGVTATGAACCGCCCGAACAGTTGCGACGGACCATTGG 

AlalleAlaSerLeuMetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGln 
34 81 CCGCCATTGCTTCATTGATGGCTTTTACAGCTGCTGTCACCAGCCC ACTAACCACTAGCC 
GGCGGTAACGAAGTAACTACCGAAAATGTCGACGACAGTGGTCGGGTGATTGGTGATCGG 

ThrLeuLeuPheAsnlleLeuGlyGlyTrpValAlaAlaGlnLeuAlaiAlaProGlyAla 
3541 AAACCCTCCTCTTCAACATATTGGGGGGGTGGGTGGCTGCCCAGCTCGCCGCCCCCGGTG 
TTTGGGAGGAGAAGTTGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCAC 

AlaThrAlaPheValGlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGly 
3601 CCGCTACTGCCTTTGTGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGG ACTGG 
GGCGATGACGGAAACACCCGCGACCGAATCGACCGCGGCGGTAGCCGTCACAACCTGACC 

LysValLeuIleAspIleLeuAlaGlyTyrClyAlaGlyValAlaGlyAlaLeuValAla 
3661 GGAAGGTCCTCATAGACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGG 
CCTTCCAGGAGTATCTGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACC 

PheLysIleMfitSerGlyGluValProSerThrGluAspLeuValAsnLeuLeuProAla 
3721 CATTCAAG ATCATG AGCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCG 
GTAAGTTCTAGTACTCGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGC 

IleLeuSerProGlyAlaLeuValValGlyValValCysAlaAlalleLeuArgArgHis 
3781 CCATCCTCTCGCCCGGAGCCCTCGTAGTCGGCGTGGTCTGTGCAGC AATACTGCGCCGGC 
GGITAGGAGAGCGGGCCTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCG 

ValGlyProGlyGluGlyAlaValGlnTrpMetAsnArgLeuIleAlaPheAlaSerArg 
3841 ACGTTGGCCCGGGCG AGGGGGC AGTGCAGTGG ATG AACCGGCTG AT AGCCTTCGCCTCCC 
TGCAACCGGGCCCGCTCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGG 

GlyAsnHisValSerProThrHisTyrVal?rcCluSerAspAlaAlaAla;i.rgValThr 
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Hoi GGGGGAACCATGTTTCCCCCACrCACTACGTGCCGG;xGAGCGATGCAGCTGCCCCCGTCA 
CCCCCTTGGTACAAAGGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGT 

AlalleLeuSerSerLeuThrValThrGlnLeuLeuArgAroLeuHisGlnTrpIleSer 
3961 CTGCCATAC7CAGC AGCCTCACTGTAACCC AGCTCCTG AGGCG ACTGCACC AGTGG ATAA 
GACGGTATGAGTCGTCGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATT 

SerGluCysThrThrProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCys 
4 0 2 1 - GCTCGG AGTGTACCACTCCATGCTCCGGTTCCTGGCTAAGGG ACATCTGGG ACTGGATAT 
CGAGCCTCACATGGTGAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATA 

GluValLeuSerAspPheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGly 
4 081 GCGAGGTGTTGAGCGACTTTAAGACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTG 
CGCTCCACAACTCGCTGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGAC 

IleProPheValSerCysGlnArgGlyTyrLysGlyValTrpArgValAspGlylleMet 
4141 GG ATCCCCTTTGTGTCCTGCCAGCGCGGGTA'^AAGGGGGTCTGGCG AGTGGACGGCATCA 
CCTAGGbCAAACACAGGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGT 

HisThrArgCysHisCysGlyAlaGluIleThrtlyHisValLysAsnGlyThrMetArg 
4 201 TGCACACTCGCTGCC ACTGTGG AGCTG AG ATCACTGG AC ATGTCAAAAACGGG ACG ATG A 
ACGTGTGAGCGACGGTGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACT 

IleValGlyProArgThrCysArgAsnMetTrpSerGlyThrPheProIleAsnAlaTyr 
4 261 GG ATCGTCGGTCCTAGGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCT 
CCTAGCAGCCAGGATCCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGA 

ThrThrGlyProCysThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgVal 
4 321 ACACCACGGGCCCCTGTACCCCCCTTCCTGCGCCG AACTACACGTTCGCGCTATGGAGGG 
TGTGGTGCCCGGGGACATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCC 

SerAlaGluGluTyrVdiuxuIleArgGlnValGlyAspPheHisTyrValThrGlyMet 
4 381 TGTCTGCAGAGGAATATGTGGAGATAAGGCAGGTGGGGGACTTCCACTACGTGACGGGTA 
ACAGACGTCTCCTTATACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCAT 

ThrThrAspAsnLeuLysCysProCysGlnValProSerProGluPhePheThrGluLeu 
4 4 41 TGACTACTGACAATCTCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAG AAT 
ACTGATGACTGTTAGAGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTA 

AspGlyValArgLeuHisArgPheAlaProProCysLysProLeuLeuArgGluGluVal 
4 501 TGGACGGGGTGCGCCTACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGG AGGAGG 
ACCTGCCCCACGCGGATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCC 

SerPheArgValGlyLeuHisGluTyrProValGlySerGlnLeuProCysGluProGlu 
4561 TATCATTCAGAGTAGGACTCCACG AATACCCGGTAGGGTCGCAATT ACCTTGCG AGCCCG 
ATAGTAAGTCTCATCCTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGCTCGGGC 

ProAspValAlaValLeuThrSerMetLeuThrAspProSerHisIleThrAlaGluAla 
4621 AACCGGACGTGGCCGTGTTGACGTCCATGCTCACTGATCCCTCCCATATAACAGCAGAGG 
TTGGCCTGCACCGGCACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCC 

AlaGlyArgArgLeuAlaArgGlySerProProSerValAlaSerSerSerAlaSerGln 
4681 CGGCCGGGCGAAGGTTGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCC 
GCCGGCCCGCTTCCAACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGG 

LeuSerAlaProSerLeuLysAlaThrCysThrAlaAsnHisAspSerProAspAlaGlu 
4741 AGCTATCCGCTCCATCTCTCAAGGCAACTTGCACCGCTAACCATG ACTCCCCTG ATGCTG 
TCGATAGGCGAGGTAGAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGAC 

LeuIleGluAlaAsnLeuLeuTrpArgGlnGluMetGlyGlyAsnlleThrArgValGlu 
4 801 AGCTCATAG AGGCC AACCTCCTATGG AGGCAGG AG ATGGGCGGCAACATCACCAGGGTTG 
TCGAGTATCTCCGGTTGGAGGATACCTCCGTCCTCTACCCGCCGTTGTAGTGGTCCCAAC 

SerGluAsnLysValVallleLeuAspSerPheAspProLeuValAlaGluGluAspGlu 
4 861 AGTCAGAAAACAAAGTGGTGATTCTGGACTCC7TCGATCCGCTTGTGGCGGAGG AGGAGG 
TCAGTCTTTTGTTTCACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGC 
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•'^"^ ^;GcgggagatctccgtacccgcagaaaxctgcggaIgtctcggagattcgc?caSg^^^ 

TCGCCCTCTAGAGGCATGGGCGTCTTTAGGACGCCTTCACAGCCTCTAAGCGGGTCCGGG 

, o„ , ^^^°Yf^il5PAlaArgProAspTyrAsnProProLeuValGluThrTrpLysL/sProAsp 
4 981 TGCCCGTTTGGGCGCGGCCGGACTATAACCCCCCGCTAGTGGAGACGTGGAAAAAGCCCG 
ACGGGCAAACCCGCGCCGGCCTGATATTGGGGGGCGATCACCTCTGCACCTT-TTCGGGC 

c,>. , .^I^£9^"P^"°P^oValValHisGlyC7sProLeuProProProLysSerProProValPro 
504 1 ACTACGAACCACCTGTGGTCCATGGCTGTCCGCTTCCACCTCCAAAGTCCCCTCCTGTGC 
• TGATGCTTGGTGGACACCAGGTACCGACAGGCGAAGGTGGAGGTTTCAGGGGAGGACACG 

c , „ , ^ P^°P^°ArgLysLysArgThrValValLeuThrGluSerThrLeMSerThrAlaLeuAla 
5101 CTCCGCCTCGGAAGAAGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGG 
GAGGCGGAGCCTTCTTCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACC 

=;i<;i „JJ"^«)J^ifT!}£^'^'3SerPheGlySerSerSerThrSerGlyIleThrGlyAspAsnThr 
= ^ ^ ^ SS^tS.^^^^^^' -CAGAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCG ACAATA 
GGCTCGAGCGGTGGTCTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTAT 

^'^^^ThrSerSerGluProAlaProSerGlyCysProProAspSerAspAlaGluSerTyr 
" ^ ^ S^ACAACATCCTCTGAGCCCGCGCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCT 

GCTGTTGTAGGAGACTCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGA " 

= , ^^^erSerfletProProLeuGluGlyGluProGlyAspProAspLeuSerAspGlySerTrp 
5281 ATTCCTCCATGCCCCCCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCAT 
TAAGGAGGTACGGGGGGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTA 

- -, , , ^^^^f^^^v^^SerSerGIuAlaAsnAlaGluAspValValCysCysSer.MetSerTyrSer 
3 3 4 1 GGTCAACGGTCAGTAGTGAGGCCAACGCGGAGGATGTCGTGTGCTGCTCAATGTCTTACT 
CCAGTTGCCAGTCATCACTCCGGTTGCGCCTCCTACAGCACACGACGAGTTACAGAATGA 

- , „, ^^P'^^^^lyAlaLeuValThrProCysAlaAlaGluGluGlnLysLeuProIleAsnAla 
3- 0 X CTTGGACAGGCGCACTCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATG 
GAACCTGTCCGCGTGAGCAGTGGGGCACGCGGCGCCTTCTTGTCTTTGACGGGTAGTTAC 

c 1 "AsnSerLeuLeuArgHisHisAsnLeuValTyrSerTlirThrSerArgSerAla 
^ t ^GCAACTCGTTGCTACGTCACCACAATTTGGTGTATTCCACCACCTCACGCAGTG 
GTGATTCGTTGAGCAACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCAC 

c c , ^^"ArgGlnLysLysValThrPheAspArgLeuGlnValLeuAspSerHisTyrGln 
5521 CTTGCCAAAGGCAG AAGAAAGTCACATTTGACAGACTGCAAGTTCTGG ACAGCCATTACC 
GAACGGTTTCCGTCTTCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATGG 

rrn. ^^P^^lLeuLysGluValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerVal 
5581 AGGACGTACTCAAGGAGGTTAAAGCAGCGGCGTCAAAAGTGAAGGCTAACTTGCTATCCG 
TCCTGCATGAGTTCCTCCAATTTCGTCGCCGCAGTTTTCACTTCCGATTGAACGATAGGC 

GluGluAlaCysSerLeuThrProProHisSerAlaLynSerLysPheGlyTyrGlyAla 
5641 TAGAGGAAGCTTGCAGCCTGACGCCCCCACACTCAGCCAAATCC AAGTTTGGTTATGGGG 
ATCTCCTTCGAACGTCGGACTGCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCC 

r-,«, „ ^ysAspValArgCysHisAlaAxgLysAlaValThrHisIleAsnSerValTrpLysAsp 
5/01 CAAAAGACGTCCGTTGCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAG 
GTTTTCTGCAGGCAACGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTC 

c^^, ^E:®"L«"GluAspAsnValThrProIleAspThrThrHeMetAlaLysAsnGluVal?he 
5761 ACCTTCTGGAAGACAATGTAACACCAATAGACACTACCATCATGGCTAAGAACG AGGTTT 
TGGAAGACCTTCTGTTACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAA 

r„,, CysValGlnProGluLysGlyGlyArgLysProAlaArgLeuIleValPheProAspLeu 
- 8 2 1 TCTGCGTTCAGCCTGAGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATC 
AGACGCAAGTCGGACTCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAG 

GlyValArgValCysGluLysMetAlaLeuTyrAsDValValThrLvsr.euProLeuA.la 
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5881 TGGGCGTGCGCGTGTGCGAAA/\GATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGG 
ACCCGCACGCGCACACGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACC 

ValMetGlvSerSerTyrGlyPheGlnTyrSerProGlyGlrLArgValGluPheLeuVai 
5941 CCGTGATGGG AAGCTCCTACGGATTCCAATACtCACCAGGACAGCGGGTTG AATTCCTCG 
GGCACTACCCTTCGAGGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGC 

GlnAlaTrpLysSerLysLysThrProMetGlyPheSerTyrAspThrArgCysPheAsp 
6001 TGCAAGCGTGGAAGTCCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTG 
ACGTTCGCACCTTCAGGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAAC 

SerThrValThrGluSerAspIleArgThrGluGluAlalleTyrGinCysCysAspLeu 
6061 ACTCCACAGTCACTGAGAGCGACATCCGTACGGAGGAGGCAATCTACCAATGTTGTGACC 
TGAGGTGTCAGTGACTCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTTACAACACTGG 

AspProGlnAlaArgValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyPro 
6121 TCG ACCCCCAAGCCCGCGTGGCCATCAAGTCCCTCACCGAGAGGCTTTATGTTGGGGGCC 
AGCTGGGGGTTCGGGCGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGG 

LeuThrAsnSerArgGlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeu 
6181 CTCTTACCAATTCAAGGGGGGAGAACTGCGGCTATCGC AGGTGCCGCGCGAGCGGCGTAC 
GAGAATGGTTAAGTTCCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGCTCGCCGCATG 

ThrThrSerCysGlyAsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysAi lAla 
624 1 TGACAACTAGCTGTGGTAACACCCTCACTTGCTACATCAAGGCCCGGGCAGCCTGTCGAG ^ 
ACTGTTGATCGACACCATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTC 

AlaGlyLeuGlnAspCysThrMetLeuValCysGlyAspAspLeuValVallleCysGIu 
6 301 CCGCAGGGCTCCAGGACTGCAC^J^ TGCTCGTGTGTGGCGACGACTTAGTCGTTATCTGTG 
GGCGTCCCGAGGTCCTGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACAC 

SerAlaGlyValGlnGluA.spAlaAlaSerLeuArgAlaPheThrGluAlaMetThrArg 

63 61 AAAGCGCGGGGGTCCAGGAGGACGCGGCGAGCCTGAGAGCCTTCACGGAGGCTATGACCA 

TTTCGCGCCCCCAGGTCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGT 

TyrSerAlaProProGlyAspProProGlnProGluTyrAspLeuGluLeuIleThrSer 

64 21 GGTACTCCGCCCCCCCTGGGGACCCCCCACAACCAGAATACGACTTGGAGCTCATAACAT 

CCATGAGGCGGGGGGGACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTA 

CysSerSerAsnValSerValAlaHisAspGlyAlaGlyLysArgValTyrTyrLeuThr 
64 81 CATGCTCCTCCAACGTGTCAGTCGCCCACGACGGCGCTGGAAAG AGGGTCTACTACCTCA 
GTACGAGGAGGTTGCACAGTCAGCGGGTGCTGCCGCGACCTTTCTCnCAGATGATGGAGT 

ArgAspProThrThrProLeuAlaArgAiaAlaTrpGluThrAlaArgHisThrProVal 
6541 CCCGTG ACCCTACAACCCCCCTCGCGAGAGCTGCGTGGG AG ACAGCAAGACACACTCCAG 
GGGCACTGGGATGTTGGGGGGAGCGCTCTCGACGCACCCTCTGTCGTTCTGTGTGAGGTC 

AsnSerTrpLeuGlyAsnllelleMetPheAlaPfoThrLeuTrpAlaArgMetlleLeu 
6601 TCAATTCCTGGCTAGGCAACATAATCATGTTTGCCCCCACACTGTGGGCGAGG ATGATAC 
AGTTAAGGACCGATCCGTTGTATTAGTACAAACGGGGGTGTGACACCCGCTCCTACTATG 

MetThrHisPhePheSerValLeuIleAlaArgAspGlnLeuGluGlnAlaLeuAspCys 
6661 TGATGACCCATTTCTTTAGCGTCCTTATAGCCAGGGACCAGCTTGAACAGGCCCTCGATT 
ACTACTGGGTAAAGAAATCGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCTAA 

GluIleTyrGlyAlaCysTyrSerlleGluProLeuAspLeuProProllelleGlnArg 
6721 GCGAGATCTACGGGGCCTGCTACTCCATAG AACCACTTGATCTACCTCCAATCATTCAAA 
CGCTCTAGATGCCCCGGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTT 
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Lane 
Number 



Ch imp 
Ro terance 
Number 



Infection 
Type 



Sample 
dace (days) 
( 0-inoculdtion 
day) 



XLT (alanine) 
aminocransferaje 
level in sera) m></ml 



3 

4 

6 
7 
9 

9 
I 0 
1 I 

I r 

I 3 

1 4 

1 5 

1 6 

: 7 

13 
19 
20 

21 

2 2 
2 3 
2i 

26 
2? 
28 
29 

30 
31 
32 
33 

34 

35 
36 



NAN8 
NANB 

NANB 
NANB 

HANB 

NANB 
NANB 
NANO 
NANB 

NANB 
NANB 
NANB 
NANB 

KAV 
KAV 
HAV 
HAV 

HAV 
HAV 
HAV 
HAV 

HAV 
HAV 
HAV 
HAV 

HAV 
HAV 
HAV 
HAV 

HBV 
HBV 
HBV 



0 
7b 
119 
154 

0 
21 
73 
130 

0 

4 ) 

53 
15^ 

•3 

5 5 
83 

140 

0 
25 
40 
268 

-8 
15 
4 I 
129 

0 
22 
115 
139 

0 
26 

74 

205 

■290 
379 
435 



0 
71 
19 
N/A 

5 
52 
13 
N/A 

8 

205 
I 4 

6 

1 I 
132 
N/A 
N/A * 

4 

147 
l8 
5 

N/A 

tot; 

10 
N/A 

7 

83 
5 

N/A 

15 
I )0 
8 
5 

M/A 
9 
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37 
38 
39 

40 

4 1 
42 
4 3 
4 4 
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10 
10 
10 

1 1 
1 1 
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HBV 
HBV 
HBV 
HBV 

HBV 
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HBV 
HBV 



0 

111-118 
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240 
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223 
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13 
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9 
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I 
2 
3 
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5 
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8 
9 
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12 
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15 
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20 
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ll 
2l 
2l 
2l 
3l 
3l 

Tl 

4 I 
4l 
5l 
5l 
6l 
61 
7l 
7l 

81 
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Diagnosis 



AJ.T Level (mu/ml) 



NANB 

NA^B 

NANS 

NANS 

NANS 

NAMB 

NAI4B 

NAHB 

NANB 

NAN9 

NANB 

NANB 

NANB 

NANB 

NANB 

NANB 

NANB 

NANB 

NANB 

NANB 

NANB 



1354 
31 
14 
79 
26 
78 
87 
25 
60 
13 
298 
101 
4 74 
318 
20 
163 
44 
50 
N/A 
N/A* 
N/A 



22 
23 
24 



12 
13 
14 



Normal 
Normal 
Normal 



N/A 
N/A 
N/A 



26 


30174 


Normal 


N/A 


27 


30105 


Normal 


N/A 


28 


30072 


Normal 


N/A 


29 


30026 


Normal 


N/A 


30 


30146 


Normal 


N/A 


3 I 


30250 


Normal 


N/A 


32 


3007 1 


Normal 


N/A 


33 


15 


AcuteHAV 


N/A 


34 


16 


AcuceHAV 


N/A 


35 


17 


AcuteHAV 


N/A 


36 


18 


AcuteHAV 


N/A 


37 


48088 


AcuteHAV 


N/A 


38 


47288 


AcuteHAV 


N/A 


39 


47050 


AcuteHAV 


N/A 


40 


46997 


AcuteHAV 


N/A 


4 I 


19 


Convalescent HBV 


N/A 


4 2 


20 


( ant i-HBSag^ve; 


N/A 


43 


21 


anti-HBCag*ve) 


N/A 


4 4 


22 


( anti-HBSag-t^ve ; 


N/A 


45 


23 


anti-HBCag+ve) 


N/A 


46 


24 


( anti-HBSag+ve; 


N/A 


47 


25 


anti-HBCag+ve ) 


N/A 


49 


26 


{ anti-HBSag+ve; 


N/A 


4S 


27 


anci-HBSag't-ve) 


N/A 



Sequentidl serum samples were assayed from these patients 
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Clal« 1 

Hlad3 7^ 

sti 212 



SphI 12350 

Ncoi iiaos 



Xbal 
10727 

Hind3 
10413 



Hind3 \ 
9099 -A 

Ecorl 
9023 

Hcol 8484 

Ecorl 8260 

Xpal 7870 
Clal 7774 
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FIG. 36-1 COOH-terminus of SOD-ClOO Fusion Polypeptide 

SOD^ COOri] (—adaptor ] (NANBHpolypeptide> 

AlaCysGlyVallleGlylleAlaGlnAsnLeuGlylleAxgAspAlaHisPheLeuSer 
1 GCTTGTGGTGTAATTGGGATCGCCCAGAATTTGGGAATTCGGGATGCCCJ^CTTTCTATCC 
CGAACACCACATTAACCCTAGCGGGTCTTAAACCCTTAAGCCCTACGGGTGAAAGATAGG 

-->>>>>>>>>>>>>>>>>>>>> 
GlnThrLysGlnSerGlyGluAsnl^uProTyrLeuValAlaTyrGlnAlaThrValCys 
61 CAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACCAAGCCACCGTGTGC 
GTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCATCGCATGGTTCGGTGGCACACG 

AlaArgAlaGlriAlaProProProSexTrpAzipGlnMetTrpLysCysLeuIleArgLeu 
121 GCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAG ATGTGGAAGTGTTTG ATTCGCCTC 
CGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCACAAACTAAGCGGAG 

LysProThrLeuHlsGlyProThrProLeuLeuTyrArgLeuGlyAlaValGlnAsnGlu 
181 AAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAG ACTGGGCGCTGTTCAG A^ 

TTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGCGACAAGTCTTACT^ 

IleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSerAlaAspLeuGlu 
241 ATCACCCTGACGCACCCAGTCACCAAATACATCATGACATGCATGTCGGCCGACCTGG A^ 
TAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACAGCCGGCTGGACCTC 

ValValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeuAlaAlaTyrCys 
301 GTCGTCACGAGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTTTGGCCGCGTATTGC 
CAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAAACCGGCGCATAACG 

LeuSerThrt^lyCyr^^alVa IleValGlyArgValValLeuSerGlyLysProAlalle 
361 CTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCGGGAAGCCGGCA^ 

GACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGCCCTTCGGCCGTTAG 

IleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGluCysSerGlnHis 
4 21 ATACCTGACAGGGAAGTCCTCTACCGAGAGTTCGATGAGATGGAAGAGTGCTCTCAGCAC 
TATGGACTGTCCCTTCAGGAGATCGCTCTCAAGCTACTCTACCTT 

LeuProTTTlleGluGlnGlyMetMetLeuAlaGluGlnPheLysGlnLysAlaLeuGly 
481 TTACCGTACATCGAGCAAGGGATGATGCTCGCCGAGCAGTTCAAGCAGAAGGCCCTCG^ 
AATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCGTCTTCCGGGAGC^ 

LeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaValGlnThrAsnTrp 
541 CTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTGTCC^^ 

GAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGACAGGTCTGGT^ 

GlnLysI^uGluThrPheTrpAlaLysHisMetTrpAsnPhelleSerGlylleGlnTyr 
601 CAAAAACTCGAGACCTTCTGGGCGAAGCATATGTGG AACTTCATCAG 
GTTTTTGAGCTCTGGAAGACCCGCTTCGTATACACCT^ 

I^iiAlaGlyLeuSerThrLeuProGlyAsnProAlalleAlaSerLeiiMetAlaPheThr 
661 TTGGCGGGCTTGTCAACGCTGCCTGGTAACCCCGCCATTGCTTGATTGAO^ 

AACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTAACTACCGAAAATGT 

AlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsnlleLeuGlyGly 
721 GCTGCTGTCACCAGCCCACTAACCACTAGCCAAACCCTCCTCTTCAACATAl^ 
CGACGACAGTGGTCGGGTCATTGGTGATCGGTTTGGGAGGAGAAGTTC 

TrpValAlaAlaGlnl^uAlaAlaProGlyAlaAlaThrAlaPheValGlyAlaGlyLeu 
781 TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTGTGGGCGCTGGCTTA 
ACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAACACCCGCGACCGAAT 

AlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAspIleLeuAlaGly 
341 GCTGGCGCCGCCATCGGCAGTGTTGGACTGGGG AAGGTCCTCATAG ACATCCTTGCAGGG 



CGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATCTGTAGGAACGTCCC 

TyrGly AlaGly .*d lAlaGlyAlaLeuValAiaPheLys IleMetSerGlyGluValPro 
901 TATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAG ATCATG AGCGGTGAGGTCCCC 
ATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACTCGCCACTCCAGGGG 

SerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGlyAlaLeuValVal 
961 TCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCGGAGCCCTCGTAGTC 
AGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGCCTCGGGAGCATCAG 

GlyValValCysAlaAlalleLeuArgArgHisValGlyProGii-GluGlyAlaValGln 
1021 GGCGTGGTCTGTGCAGCAATACTGCGCCGGOVCGTTGGCCCGGGCG AGGGGGCAGTGCAG 
CCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGCTCCCCCGTOVCGTC 

<<<<<<<<<<<<<<<<<<<<NANBH] [ extra 

TrpMetAsnArgLeuIleA \aPhe' laSerArgGlyAsnHisValSerProValHisHis 
1081 TGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTTCCCCAGTCCATCAT 
ACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAAGGGGTCAGGTAGTA 

] 

LysArgOP 

1141 AAGCGTTGACGCTCCCTACGGGTGGACTGTGGAGAGACAGGGCACTGCTAAGGCCCAAAT 
TTCGCAACTGCGAGGGATGCCCACCTGACACCTCTCTGTCCCGTGACGATTCCGGGTTTA 

1201 CTCAGCCATGCATCGAGGGGTACAATCCGTATGGCCAACAACTAGCGCGTACGTAAAGTC 
GAGTCGGTACGTAGCTCCCCATGTTAGGCATACCGGTTGTTGATCGCGCATGCArj.rrCAG 

1261 TCCTTTCTCGATGGTCCATACCTTAGATGCGTTAGCATTAATCCGAATTC 
AGGAAAGAGCTACCAGGTATGGAATCTACGCAATCGTAATTAGGCTTAAG 
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FIG. 41-1 

Homology between the HCV polypeptide encoded by combined ORF of clones 
14i through 39c) and the non-structural protein of the Dengue flavL- 
virus{MNWVDl) . 
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_ 10 20 30 40 50 

E YWLLFLLLADAKVCS CLWMMLL I SQAEAALENLV I LNAAS LAG THG LVS FLVFFC FA 

AVSFVTLITGNMSFRDIX3RVMVMVGATMTDDIGMGVTYIAIJJU\FK\^ 

130 140 150 160 170 180 

60 70 80 90 100 110 

W^-LKGKWVPGAVYTFYGMWPIJXIJJLALPQRAYAIXiTEr/AASCG 

TSKELMhilTI G I VIJ^QST I PETILEJ^TDAIAI^MMVLKMVRK^1EKY0IJ^VT 

190 200 210 220 230 240 

120 130 140 150 160 170 

KRYISWCLWWUJYFLTRVEAQIifVWIPPIJ^GGRDAVIU^CAVHPT^^ 

NAVILQNAWKVSCTIIJVVVSVSPIJ^LTSSQQKADWIPLP^ 

250 260 270 280 290 

180 . 190 200 210 220 230 

FG PLWILOASIXKVPYF-VRVQGIIJ^F-CAIjyiKMIGGHYVQMVI IKIX;ALTGT^^ 

• J* • 

KKRSWPLNEAIMAVGMVSILASSLIJCNDIPMTGPLVAGGli-TVCYV-LTGRS^ 
300 3-10 320 330 340 350 

240 250 260 270 280 290 

TPLRDWAHNGLRDLAVAVEPWFSQMETKLITWGADTAACGDIINGLPVSARRGREILLG 



ADVK-WEDQAEISGSSPILSITISE-DGSMSIKNEEEEQTLTILIRTGLLVISG LFP 

360 370 380 390 400 410 

300 310 320 330 340 350 

P ADGMVS KGWRUAP ITA YAQQTRGIXGC I ITSLTGRDKNQVEGEVQ I VS TAAQTFLATC 
• ••*'• • 

VS I P I TAAAWYLWEVKKQRAGVLWDVPS PP PVGKAELe£gaYRI KQKG ^ 
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360 370 380 390 400 410 

INGVCWTVYHGAGTRTIASPKGPVIQMYTNVDQDLV GWPAPQGSRSLTPCTCGSSD 



KEGTFHTMWHVTRGAVI/IHKGKRIEPSWADVKKDLVSCGGGWKLEGEWKEGEEVQVIJVL^ 
480 490 500 510 520 530 
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LYLVTRHADVI PVRRRGDSRGSLLSPRP ISYLKGSSGGPLLCPAGHAVG I FRAAVCTRGV 

PGKNPRAVQTKPGLFKTN—AGTIGAVsiiFSPGTSG^ 
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480 490 500 510 520 530 

AKA'/DFIPVENLETTMRSPVFTDNSSPPWPQSFQVAHLHAPTGSGKS~TK\'PAAYAAQ 



AYVSAIAQTEK—SIEDNPEIEDDIFRK RKLTIIIDLHPGAGKTKRYLPAIVHGAIKR 

600 610 620 630 640 
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HCV GYKVLVLNPS--VAATLGFGAYMSKAHGinPNirTGVKTITTGSPITySTYGKFLADGGC 

• IIS •••••••••*• 

MN-WVDl GLRTLILAPTRVVApi>d:EALRGLPIRyQTPAIRA£HTGREIVDIJl^ 

650 660 670 680 690 700 

590 600 610 620 630 640 

HCV SGGAYDI I ICDECHSTDATS ILG IGTVLDQAETAG ARLWLATATPPGSVTVPHPNIEEV 

X ••••••• •> • • • ••••••• \r 

• ••• >• • •••••• a • , •••• • • • Va 

MNWVDl RVP^nfNLIIMJ)E:AHFTDPASIAARGYISTRVE-MGEAAGIFMTATPPGSRD-PFPOSNAP 
710 720 730 740 750 760 

650 660 670 680 690 700 I 

HC/ ALSTTGEIPFYGKAIPIZVIKGGRHLIFCHSKKKCDEnLAAKLVALGINAVAVnfRGIJDVS 

MNWVDl IMDEEREIPERSWSSGH '^DFKGKTVWFVPSIKAGbmTAACLRKNGKKVTQLSRKTFD 

770 780 790 800 810 820 

710 720 730 740 750 760 

HCV I ? TSGDWWATDALMTG YTGDFDSV I DCNTCVTQTVDFSLEPTFT I ET I TLPQDAVS RT 

MlWVDl SEYVKTRT^^)WNFVVTTDISE^1GANFKAERVIDPRRCMKPVILTIX;EERVIIJVGP 
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MNWVDl SS 
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DISTRIBUTIOH OF RANDOM SAMPLES 
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FIG. 44 

Distribution of P.P. Values for 
gandom Blood Donor Saggles Tested with Two ELISA 

Coof iguratioos 

CI 00-3 Ag ELISA MoAB vs Polyclonal 
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Name 


-Common Sequence 


Variable Sequence 


5'-3-i 
-2 
-3 
-4 

-5 


AAGCTTGATCGAATTC 


CGATCTTGC 
CGATCCTGC 
CGATCATGC 
CGATCGTGC 
CGAAGTTGC 
CGAAGCTGC 


-7 
-8 . 
-9 
-10 

1 i 

-12 




AGATCTTGC 
AGATCCTGC 
AGATCATGC 
AGATCGTGC 
AGAAGTTGC 
AGAAGCTGC 


-13 
-14 
-15 
-15 

■~ 1 / 

■ -18 




CGATCTTGT 
CGATCCTGT 
CGATCATGT 
CGATCGTGT 
CGAAGTTGT 
CGAAGCTGT 


-19 
-20 
-21 
-22 

-24 




AGATCTTGT 
AGATCCTGT 
AGATCATGT 
AGATCGTGT 
AGAAGTTGT 
AGAAGCTGT 


-25 
-26 
-27 
-28 
-29 
-30 




CGCTCTTGC 
CGCTCCTGC 
CGCTCATGC 
CGCTCGTGC 
CGCAGTTGC 
CGCAGCTGC 


-31 
-32 
-33 
-34 
-35 
-36 




CGCTCTTGT 
CGCTCCTGT 
CGCTCATGT 
CGCTCGTGT 
CGCAGTTGT 
CGCAGCTGT 



FIG. 46 "I "-^nsUcion oc D:;a 

GlyCysPrcGLuArgLeuAldSerCysArgProLeuThrAspPheAscGlriCIyTrpCly 
1 CACGCTGTCCTGAGAGGCTAOCCAGCTCCCGACCCCTTACCGATTTTGACCAGGGCTGGG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyL-AlaAsnGiySerGLyProAspGlnAxgProTyrCysTrpHisTyrPro 
61 GCCCTATCAGTTATGCCAACGGAAGCGGCCCCGACCAGCGCCCCTACTGCTGGCACTACC 
CGGGATAGTCAATACGGTTGCCTTCGCCJGGGCTGGTCGCGGGGATGACGACCGTGATGG 

ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 
121 CCCCAAAACCTTGCGGTATTGTGCCCGCGAAGAGTGTGTGTGGTCCGGTATATTGCTTCA 
GGGGTTTTGGAACGCCATAACACGGGCGCTTCTCACACACACCAGGCrATATAACGAAGT 

ProSerProValValValGlyThrThrAsDArgSerGlyAlaProThrTyrSerTrpGly 
181 CTCCCAGCCCCGTGGTGGTGGGAACGACCGACAGGTCGGGCGCGCCCACCTACAGCTGGG 
GAGGGTCGGGGCPCCACCACCCTTGCTGGCTGTCCArCCCGCGCGGGTGGATGTCGACCC 

GluAsnAspThrAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTroPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

IleGiyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArgLysHisPro 
361 TCATCGGAGGGGCGGGCAACAACACCCTGC ACTGCCCCACTG ATTGCTTCCGCAAGCATC 
AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG 

AspAlaThrTyrSf^rArgCysGlySerGlyProTrpIleThxProArgCysLeuValAsp 
4 21 CGGACGCCACATACTC .CGGTGCGGCTCCGGTCCCTGGATCACACCCAGGTGCCTGGTCG 
GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 



TyrProTyrArgLeuTrpHisTyrProCysThrlleAsnTyrThrllePheLysIleArg 
4 81 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACTATATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACMGGTAGTTGATGTGATATAAATTTTAGT 



MetTyrValGlyGlyValGluHisArgLeuGluAlaAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGG AGGGGTCG AGCACAGGCTGGAAGCTGCCTGCAACTGG ACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTCGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 



ArgCysAspLeuGluAspArgAspArgSerGluLeuSerProLeuLeuLeuThrThrThr 
601 AACGTTGCGATCTGGAAGATAGGGACAGGTCCG AGCTCAGCCCGTT ACTGCTG ACCACTA 
TTGCAACGCTAGACCTTGTATCCCTGTCCAGGCTCGAGTGGGGCAATGACGACTGGTGAT 



GInTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThrGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTGCCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCa\GGAGGGC^CAAGGAAGTGTTGGGACGGTCGGAACAGGTGGCCGGAGT 

Overlap with Combined ORF of DNAs 12 f through 15e 

HisLeuHisGlnAsnlleValAspValGlnXyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAG AACATTGTGGACGTGCAGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 



SerTrpAlalleLys.TrpGluTyrValValLeuLeuPheLeuLeuLeuAlaAspAlaArg 
7 S 1 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTCCTCCTGTTCCTTCTGCTTGCAG ACGCGC 
GCAGGACCCGGTAATTCACCCTCATGCAGCAGGAGGACAAGGAAGACGAACGTCTGCGCG 



VaLCysSerCysLeuTrpMetMecLeuLeuIieSerGlnAlaGiuAlaAlaLeuGluAsn 
841 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAAGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTTCGCCGAAACCTCT 



LeuVallleLeuAsoAlaAlaSerLeuAlaGlyThrHisGiyLeuValSerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCGGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGxAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 



PhePheCysPheAlaTrpTyrLeuLysGlyLysTrpValProGlyAiaValTyrThrPhe 
961 TGTTCTTCTGCTTTGCATGGTATCTGAAGGGTAAGTGGGTGCCCGG AGCGGTCTACACCT 
ACAAGAAGACGAAACGTACCATAGACTTCCCATTCACCCACGGGCCTCGCCAGATGTGGA 



TyrGlyMetTrpProLeuLeuLeuLeuLeuLeuAlaLeuProGlnArgAlaTyrAlaLeu 
10 21 TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGAGGAGGACGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG; 



AspThrGluValAlaAlaSerCysGlyGlyValValLeuValGlyLeu^letAlaLeuThr 
1081 TGGACACGGAGGTGGCCGCGTCGTGTGGCGGTGTTGTTCTCGTCGGGTTGATGGCGCTAA 
ACCTG7GCCTCC JCGGCGCAGCACACCGCCACAACAAGAGCAGCCCAACTACCGCGATT 



LeuS er ProTyr TyrLysAr gTyr I leS erTr pCy s LeuTr pTr pLeuGlnTyrPheLeu 
1141 CTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGTGGCTTCAGTATTTTC 
GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 



ThrArgValGluAlaGlnLeuHisValTrpIleProProLeuAsnValArgGlyGlyAxg 
1201 TG ACCAG AGTGGAAGCGCAACTGCACGTGTGGATTCCCCCCCTCAACGTCCGAGGGGGGC 
ACTGGTCTCACCTTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCGG 



AspAlaVallleLeuLeuMetCysAlaValHisProThrLeuValPheAsDileThrLys 
1261 GCGACGCTGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCA 
CGCrGCGACAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT 



. LeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAla 
1321 AATTGCTGCTGGCCGTCTTCGG ACCCCTTTGGATTCTTCAAGCCAG 
TTAACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTC 
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PIQ^ 47-1 ---'-S'-'E^ ^^-^ ^^'^ ^^^'As K'J-I through I5e 



GlyCysProOluArv^LeuAldSerCysArgProLeuThrAspPheAspGLnClyTrpGIy 
1 CAGGCTGTCCTCAGAGGCTAGCCAGCTGCCGACCCCTTACCGATTTTGACCAGGGCTr,GG 
GTCCGACAGGACTCTCCGATCGGTCGACGGCTGGGGAATGGCTAAAACTGGTCCCGACCC 

ProIleSerTyrAlaAsnGlySerGlyProAspGlnAxgProTyrCysTrpHisTyrPro 

61 gccctatcagttatgccaacggaarcggccccgaccagcgcccctactgctggcactacc 
cSggatagtcaatacggttgccttcgccggggctggtcgcggggatgacgaccgtgatgg 

ProLysProCysGlylleValProAlaLysSerValCysGlyProValTyrCysPheThr 

121 ccccaaaaccttgcggtattgtgcccgcgaagagtgtgtgtggtccggtatattgcttca 
ggggttttggaacgccataacacgggcgcttctcacacacaccaggccatataacgaagt 

ProSerProValValValGlyThrThrAspArgSerGlyAlaProThrTvrSerTrpGly 

181 ctcccagccccgtggtggtggg;j\cgaccgacaggtcgggcgcgcccacctacagctggg 
gagggtcgc^ccaccaccacccttgctggctgtvjcagcccgcgcgggtggatgtcgaccc 

GluAsoAspThxAspValPheValLeuAsnAsnThrArgProProLeuGlyAsnTrpPhe 
241 GTGAAAATGATACGGACGTCTTCGTCCTTAACAATACCAGGCCACCGCTGGGCAATTGGT 
CACTTTTACTATGCCTGCAGAAGCAGGAATTGTTATGGTCCGGTGGCGACCCGTTAACCA 

GlyCysThrTrpMetAsnSerThrGlyPheThrLysValCysGlyAlaProProCysVal 
301 TCGGTTGTACCTGGATGAACTCAACTGGATTCACCAAAGTGTGCGGAGCGCCTCCTTGTG 
AGCCAACATGGACCTACTTGAGTTGACCTAAGTGGTTTCACACGCCTCGCGGAGGAACAC 

lieGlyGlyAlaGlyAsnAsnThrLeuHisCysProThrAspCysPheArcLysHisPro 

3 6 1 TCATCGGAGGGGCGGGCAACAACACCCTGCACTGCCCCACTGATTGCTTCCGCAAGCATC' 

AGTAGCCTCCCCGCCCGTTGTTGTGGGACGTGACGGGGTGACTAACGAAGGCGTTCGTAG; 

AspAlaThrT rSerArgCysGlySerGlyProTrpIleThrProArgCysLeuValAsp 

4 2 1 CGGACGCCACATACTCTCGGTGCGGCTCCGGTCCCTGG ATCACACCCAGGTGCCTGGTCG 

GCCTGCGGTGTATGAGAGCCACGCCGAGGCCAGGGACCTAGTGTGGGTCCACGGACCAGC 

TyrProTyrArgLeuTrpHisTyrProCysThxIleAsnTyrThrllePheLysIleArg 
481 ACTACCCGTATAGGCTTTGGCATTATCCTTGTACCATCAACTACACCMATTTAAAATCA 
TGATGGGCATATCCGAAACCGTAATAGGAACATGGTAGTTGATGTGGTATAAATTTTAGT 

MetTyrValGlyGlyValGlaHisArgLeuGluAJ.aAlaCysAsnTrpThrArgGlyGlu 
541 GGATGTACGTGGGAGGGGTCGAACACAGGCTGGAAGCTGCCTGCAACTGGACGCGGGGCG 
CCTACATGCACCCTCCCCAGCTTGTGTCCGACCTTCGACGGACGTTGACCTGCGCCCCGC 

ArgCysAspLeuGluAspArgAspAxgSertluLeuSerProLeuLeuLeuThrThrThr 
601 AACGTTGCGATCTGGAAGACAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGACCACTA 
TTGCAACGCTAGACCTTCTGTCCCTGTCCAGGCTCGAGTCGGGCAATGACGACTGGTGAT 

GlnTrpGlnValLeuProCysSerPheThrThrLeuProAlaLeuSerThrGlyLeuIle 
661 CACAGTGGCAGGTCCTCCCGTGTTCCTTCACAACCCTACCAGCCTTGTCCACCGGCCTCA 
GTGTCACCGTCCAGGAGGGCACAJ^GGAAGTGTTGGGATGGTCGGAACAGGTGGCCGGAGT 

HisLeuHisGlnAsnlleValAsoValGlnTyrLeuTyrGlyValGlySerSerlleAla 
721 TCCACCTCCACCAG AACATTGTGG ACGTGC AGTACTTGTACGGGGTGGGGTCAAGCATCG 
AGGTGGAGGTGGTCTTGTAACACCTGCACGTCATGAACATGCCCCACCCCAGTTCGTAGC 

SerTrpAlalleLysTrpGluTyrValValLeuLeuPheLeuLeuLeuAlaAspAlaArg 
781 CGTCCTGGGCCATTAAGTGGGAGTACGTCGTTCTCCTGTTCCTTCTGCTTGCAG ACGCGC 
GG^GGACCCGGTAATTCACCCTCATGCAGCAAGAGGACAAGGAASACGAACGTCTGCGCG 

ValCysSexCysLeuTrpMetMetLeuLeuIleSerGlaAlaGluAlaAlaLeuGluAsn 
841 GCGTCTGCTCCTGCTTGTGGATGATGCTACTCATATCCCAAGCGGAGGCGGCTTTGGAGA 
CGCAGACGAGGACGAACACCTACTACGATGAGTATAGGGTTCGCCTCCGCCGAAACCTCT 

LeuVallleLeoAsaAlaAlaSerLeuAlaGlyThrHisGlyLeuVaiSerPheLeuVal 
901 ACCTCGTAATACTTAATGCAGCATCCCTGGCCCGGACGCACGGTCTTGTATCCTTCCTCG 
TGGAGCATTATGAATTACGTCGTAGGGACCGGCCCTGCGTGCCAGAACATAGGAAGGAGC 
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TC I TC . TCTG^TTTGCATGGTATTTG,\,\GGGTA;'.aTGGCTncCCCGAGCGGTCTACACCT 
AC.^AGAAGACGAAACGTACCATAAAv. 1 rCCCATTCACCCACGGGCCTCGCCAGATGTGGA 

TyrGlyMetTrpProLeuLeuLeuLeuLeuLeuAlaLeuProGlnArgAlaT'/rAlaLeu 
TCTACGGGATGTGGCCTCTCCTCCTGCTCCTGTTGGCGTTGCCCCAGCGGGCGTACGCGC 
AGATGCCCTACACCGGAGACGAGG^CGAGGACAACCGCAACGGGGTCGCCCGCATGCGCG 

AspThrGluValAIaAlaSerCysGlyGlyValValLeuValGlyLeuMetAlaLeuThr 
TGGA-CACGGAGGTGGCCGCGTCGTGTC3CGGTGTTGTTCTCGTCGGGTTGATGGCGCTGA 
ACCTGTGCCTCCACCGGCGCAGCACACCGCCACAACA^GAGCAGCCCAACTACCGCGACT 

LeuSerProTyrTyrLysArgTyrlleSerTrpCysLeuTrpTrpLeuGlnTyrPheLeu 
CTCTGTCACCATATTACAAGCGCTATATCAGCTGGTGCTTGTGGTGrCTTCAGTATTTTC 
GAGACAGTGGTATAATGTTCGCGATATAGTCGACCACGAACACCACCGAAGTCATAAAAG 

Thr ArgV alGl uAlaGlnLeuHi s ValTrp I leProProLeuAsnVa lArgG lyGlyArg 
TGACCAGAGTGGAAGCGCAACTGCACGTGTGGATTrcCCCCCTCAACGTCCGAGGGGGGC 
ACTGGTCTCACCrTCGCGTTGACGTGCACACCTAAGGGGGGGAGTTGCAGGCTCCCCCCG 

AspAlaVallleLeuLeuMetCysAlaValHisProThrLeuValPheAspIleThrLys 
GCGACGCCGTCATCTTACTCATGTGTGCTGTACACCCGACTCTGGTATTTGACATCACCA 
CGCTGCGGCAGTAGAATGAGTACACACGACATGTGGGCTGAGACCATAAACTGTAGTGGT ■ 

LeuLeuLeuAlaValPheGlyProLeuTrpIleLeuGlnAlaSerLeuieuLysValPro 
AATTGCTGCTGGCCGTCTTCGGACCCCTTTGGATTCTTCAAGCCAGTTTGCTTAAAGTAC 
TT.SACGACGACCGGCAGAAGCCTGGGGAAACCTAAGAAGTTCGGTCAAACGAATTTCATG 

^TyrPheValArgValGlnGlyLeuLeuArgPheCysAlaLeuAlaArgLysMetlleGly 
CLTACTTTGTGCGCGTCCAAGGCCTTCTCCGGTTCTGCGCGTTAGCGCGGAAGATGATCG 
GGATGAAACACGCGCAGGTTCCGGAAGAGGCCAAGACGCGCAATCGCGCCTTCTACTAGC 

^.Siy"^^'^^^^^-'- '"^etValllelleLysLeuGlyAlaLeuThrGlyThrTyrValTyr 

GAGGCCATTACGTGCAAATGGTCATCATTAAGTTAGGGGCGCTTACTGGCACCTATGTTT 

CTCCGGTAATGCACGTTTACCAGTAGTAATTCAATCCCCGCGAATGACCGTGGATACAAA 

,,^?""^^i®ii^^P^oLeuArgAspTrpAlaHisAsnGlyLeuArgAspLeuAlaValAla 

ATAACCATCTCACTCCTCTTCGGGACTGGGCGCACAACGGCTTGCGAGATCTGGCCGTGG 

TATTGGTAGAGTGAGGAGAAGCCCTGACCCGCGTGTTGCCGAACGCTCTAGACCGGCACC 

ValGluProValValPheSerGlnMetGluThrLysLeuIleThrTrpGlyAlaAspThr 
CTGTAGAGCCAGTCGTCTTCTCCCAAATGGAGACCAAGCTC\TCACGTGGGGGGCAGATA 
GACATCTCGGTCAGCAGAAGAGGGTTTACCTCTGGTTCGAGTAGTGCACCCCCCGTCTAT 




v.v.vav_v.v»v.i» At^AXtJVTCAACGGCTTGCCTGTTTCCGCCCGCAGGGGCCGGGAGA 
GGCGGCGCAC6CCACTGTA6TAGTTGCCGAACGGACAAAGGCGGGCGTCCCCGGCCCTCT 

LeuLeuGlyProAlaAspGlyMetValSerLysGlyTrpArgLeuLeuAlaProIleThr 
TACTGCTCGGGCCAGCCGATGGAATGGTCTCCAAGGGGTGGAGGTTGCTGGCGCCCATCA 
ATGACGAGCCCGGTCGGCTACCTTACCAGAGGrrCCCCACCTCCAACGACCGCGGGTAGT 

1741 CG 
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^.«^;v.«J.Aw^J^-v.^_i^^a^_ft^,AUAAO^a^»otLTC^;TAGGGTGCATAATCACCAGCCTAACTGGCC 
GCCGCATGCGGGTCGTCTGTTCCCCGGAGGATCCCACGTATTAGTGGTCGGATTGACCGG 



AspLysAsnGlnValGluGlyGluValGlnlleValSerThrAlaAlaGlnThrPheLeu 
GGGACAAAAACCAAGTGGAGGGTGAGGTCCAGATTGTGTCAACTCCTGCCCAAACCTTCC 
CCCTGTTTTTGGTTCACCTCCCACTCCAGGTCTAACACAGTTGACGACGGGTTTGGAAGG 

AlaThrCysIleAsnGlyValCysTrpThrValTyrHisGlyAlaGlyThrArgThrlle 
TGGCAACGTGCATCAATGGGGTGTGCTGGACTGTCTACCACGGGGCCGGAACGAGGACCA 
ACCGTTGCACGTAGTTACCCCACACGACCTGACAGATGGTGCCCCGGCCTTGCTCCTGGT 

AlaSerProLysGlyProVallleGlrJietTyrThrAsnValAspGlnAspLeuValGly 
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19 :i TCCCGTCACCC.UCCCT::TuTCATCCACArC7ATACC.\ATGTAGArCAAGACCTT^ 

ACCGCACTGGGTTCCCAGGACAGTACGTCTACATATCGTTACATCrGGTTCTGGAACACC 

TrpProAiaPrcGlnGLySerAxgSerLeuThjrProCysThrCysGIySerSerAspLeu 
198 i GCTGGCCCGCTCCGCAAGGTAGCCGCTCATTGACACCCTGCACTTGCGGCTCCTCGGACC 
CGACCGGGCGAGGCGTTCCMCGGCGAGTAACTGTGGGACGTGAACGCCCAGGAGCCTGG 

TyrLeuValThrAxgHisAlaAsDVallleProValArgArgArgGlyAspSerArgGly 
2041 TTTACCTGGTCACGAGGCACGCCGATGTCATTCCCGTGCGCCGGCGGGGTGATAGCAGGG 
AAATGGACCAGTGCTCCGTGCGGCTACAGTAAGGGCACGCGGCCGCCCCACTATCGTCCC 

* SerLeuLeuSerProArgProIleSerTyrLeuLysGlySerSerGlyGlyProLeiiLeu 
2101 GCAGCCTGCTGTCGCCCCGGCCCATTTCCTACTTGAAAGGCTCCTCGGGGGGTCCGCTGT 
CGTCGGACGACAGCGGGGCCGGGTAAAGGATGAACTTTCCGAGGAGCCCCCCAGGCGACA 

CysProAlaGlyHisAlaValGlyllePheArgAlaAlaValCysThrAxgGlyValAla 
2161 TGTGCCCCGCGGGGCACGCCGTGGGCATATTTAGGGCCGCGGTGTGCACCCGTGGAGTGG 
ACACGGGGCGCCCCGTGCGGCACCCGT.MAAATCCCGGCGCCACACGTGGGCACCTCACC 

LysAlaValAspPhelleProValGluAsnLeuGluThrThrMetAxgSerProValPhe 
2221 CTAAGGCGGTGGACTTTATCCCTGTGGAGAACCTAGAGACAACCATGAGGTCCCCGGTGT 
GATTCCGCCACCTGAAATAGGGACACCTCTTGGATCTCTGTTGGTACTCCAGGGGCCACA 

ThrAspAsnSerSerProProValValProGlnSerPheGlnValAlaHisLeuHisAla 
2281 TCACGGATAACTCCTCTCCACCAGTAGTGCCCCAGAGCTTCCAGGTGGCTCACCTCCATG 
AGTGCCTATTGAGGAGAGGTGGTCATCACGGGGTCTCGAAGGTCCACCGAGTGGAGGTAC 

ProThrGlySerGlyLysSerThrLysValProAlaAlaTyrAlaAlaGlnGlyTyrLys 
2341 CTCCCACAGGCAGCGGCAAAAGCACCAAGGTCCCGGCTGCATATGCAGCTCAGGGCTATA 
GAGGGTGTCCGTCGCCGTTTTCGTGGTTCCAGGGCCGACGTATACGTCGAGTCCCGATAT 

ValLeuValLeuAsnProSerValAlaAlaThrLeuGlyPheGlyAlaTyrMetSerLys 
2401 AGGTGCTAGTACTCAAircCTCTGTTGCTGCAACACTGGGCTTTGGTGCTTACATGTCCA 
tCCACGATCATGAGTTGC;GGAGACAACGACGTTGTGACCCGAAACCACGAATG7ACAGGT 

AlaHisGlylleAspProAsnlleArgThrGlyValArgThrlleThrThrGlySerPro 
24 61 AGGCTCATGGGATCGATCCTAACATCAGGACCGGGGTGAGAACAATTACCACTGGCAGCC 
TCCGAGTACCCTAGCTAGGATTGTAGTCCTGGCCCCACTCTTGTTAATGGTGACCGTCGG 

IleThxTyrSerXhrTyrGlyLysPheLeuAlaAspGlyGlyCysSerGlyGlyAlaTyr 
2521 CCATCACGTACTCCACCTACGGCAAGTTCCTTGCCGAGGGCGGGTGCTCGGGGGGCGCTT 
GGTAGTGCATGAGGTGGATGCCGTTCAAGGAACGGCTGCCGCCCACGAGCCCCCCGCGAA 

AspIlellelleCysAspGluCysHisSerThrAspAlaThrSerlleLeuGlylleGly 
2581 ATG ACATAATAATTTGTGACGAGTGCCACTCCACGG ATGCCACATCCATCTTGGGCATCG 
TACTGTATTATTAAACACTGCTCACGGTGAGGTGCCTACGGTGTAGGTAGAACCCGTAGC 

ThrValLeuAspGlnAlaGluThrAlaGlyAlaArgLeuValValLeuAlaThrAlaThr 
2641 GCACTGTCCTTGACCAAGCAGAGACTGCGGGGGCGAGACTGGTTGTGCTCGCCACCGCCA 
CGTGACAGGAACTGGTTCGTCTCTGACGCCCCCGCTCTGACCAACACGAGCGGTGGCGGT 

ProProGlySerValThrValProHisProAsnlleGluGluValAlaLeuSerThrThr 
2701 CCCCTCCGGGCTCCGTCACTGTGCCCCATCCCAACATCGAGGAGGTTGCTCTGTCCACCA 
GGGGAGGCCCGAGGCAGTGACACGGGGTAGGGTTGTAGCTCCTCCAACGAGACAGGTGGT 

GlyGluIleProPheTyrtlyLysAlalleProLeuGluVallleLysGlyGlyAxgHis 
2761 CCGGAG AGATCCCTTTTTACGGCAAGGCTATCCCCCTCGAAGTAATCAAGGGGGGG/^GAC 
GGCCTCTCTAGGGAAAAATGCCGTTCCGATAGGGGGAGCTTCATTAGTTCCCCCCCTCTG 

LeuIlePheCysHisSerLysLysLysCysAspGluLeuAlaAlaLysLeuValAlaLeu 
2821 ATCTCATCTTCTGTCATTCAAAGAAGAAGTGCGACGAACTCGCCGCAAAGCTGGTCGCAT 
TAGAGTAGAAGACAGTAAGTTTCTTCTTCACGCTGCTTGAGCGGCGTTTCGACCAGCGTA 

GlylleAsnAlaValAlaXyrTyrAraGlyLeuAspVdlSerVallleProThrSerGly 
2881 TGGGCATCAATGCCGTGGCCTACTACCGCGGTCTTGACGTGTCCGTCATCCCGACCAGCG 
ACCCGTAGTTACGGCACCGGATGATGGCGCCAGAACTGCACAGGCAGTAGGGCTGGTCGC 
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As?Val7alVaiV3lAlaThrAspAlaLeu>i«tThrGlvTyrThrGlyAspPheAspSer 
294 1 GCG ATGTTGTCGTCGTGGCAACCGAT -CCCTCATGACCGGCTArACCGGCGACTTCGACT 
CGCTACAACAGCAGCACCGTTGGCTACGGGAGTACTGGCCGATATGGCCGCTGAAGCTCA 

VairleAspCysAsnThrCysValThrGlnThrValAspPheSerLeuAspProThrPhe 
3001 CGGTGATAGACTGCAATACGTGTGTCACCCAGACAGTCGATTTCAGCCTTGACCCTACCT 
GCCACTATCTGACGTTATGCACACAGTGGGTCTGTCAGCTAAAGTCGGAACTGGGATGGA 

XiirlleGluThrlleThrLeuPrcGlnAspAlaValSerArgThrGlnArgArgGlyArg 
3061 TCACCATTGAGACAATCACGCTCCCCCAGG ATGCTGTCTCCCGCACTCAACGTCGGGGCA 
AGTGGTAACTCTGTTAGTGCGAGGGGGTCCTACGACAGAGGGCGTGAGTTGCAGCCCCGT 

ThrClyArgGlyLysProGlylleTyrArgPheValAlaProGl'r^luArgProSerGly 
3121 GGACTGGCAGGGGG AAGCCAGGCATCTACAGATTTGTGGCACCGGGGGAGCGCCCCTCCG 
CCTGACCGTCCCCCTTCGGTCCGTAGATGTCTAAACACCGTGGCCCCCTCGCGGGGAGGC 

MetPheAspSerSerValLeuCysGluCysTyrAspAlaGlyCysAlaTrpTyrGluLeu 
3131 GCATGTTCGACTCGTCCGTCCTCTGTGAGTGCTArGACGCAGGCTGTGCTTGGTATGAGC 
CGTACAAGCTGAGCAGGCAGGAGACACTCACGATACTGCGTCCGACACGAACCATACTCG 

) ThjrProAlaGluThrThrValArgLeuArgAlaTyrMetAsnThrProGlyLeuProVal 
3241 TCACGCCCGCCGAGACTACAGTTAGGCTACGAGCGTACATGAACACCCCGGGGCTTCCCG 
AGTGCGGGCGGCTCTGATGTCAATCCGATGCTCGCATGTACTTGTGGGGCCCCGAAGGGC 

CysGlnAspHisLeuGluPheTrpGluGlyValPheThrGlyLeuThxHisIleAspAla 
3301 TGTGCCAGG ACCAT CTTGAATTTTGGGAGGGCGTCTTTACAGGCCTCACTCATATAGATG 

ACACGGJCCTGGTACAACTTAAAACCCTCCCGCAGAAATGTCCGGAGTGAGTATATCTAC ^ 

HisPheLeuSerGlnThrLysGlnSerGlyGluAsnLeuProTyrLeuValAlaTyrGlri 

33 61 CCCACTTTCTATCCCAGACAAAGCAGAGTGGGGAGAACCTTCCTTACCTGGTAGCGTACC 

GGGTGAAAGATAGGGTCTGTTTCGTCTCACCCCTCTTGGAAGGAATGGACCAXCGCATGG 

AlaThrValCysAlaArgAlaGlnAlaProProProSerTrpAspGlnMetTrpLysCys 

34 21 AAGCCACCGTGTGCGCTAGGGCTCAAGCCCCTCCCCCATCGTGGGACCAGATGTGGAAGT 

TTCGGTGGCACACGCGATCCCGAGTTCGGGGAGGGGGTAGCACCCTGGTCTACACCTTCA 

LeuIleArgLeuLysProThrLeuHisGlyProThrProLeuLeuTyrArgLeuGlyAla 
34 81 GTTTGATTCGCCTCAAGCCCACCCTCCATGGGCCAACACCCCTGCTATACAGACTGGGCG 
CAAACTAAGCGGAGTTCGGGTGGGAGGTACCCGGTTGTGGGGACGATATGTCTGACCCGC 

ValGlriAsnGluIleThrLeuThrHisProValThrLysTyrlleMetThrCysMetSer 
3541 CTGTTC AG AATGAAATCACCCTGACGCACCCAGTCACCAAATACATCATG ACATGCATGT 
GACAAGTCTTACTTTAGTGGGACTGCGTGGGTCAGTGGTTTATGTAGTACTGTACGTACA 

AlaAspLeuGluYalValThrSerThrTrpValLeuValGlyGlyValLeuAlaAlaLeu 
3601 CGGCCG ACCTGGAGGTCGTCACG AGCACCTGGGTGCTCGTTGGCGGCGTCCTGGCTGCTT 
GCCGGCTGGACCTCCAGCAGTGCTCGTGGACCCACGAGCAACCGCCGCAGGACCGACGAA 

AlaAlaTyrCysLeuSerThrGlyCysValVallleValGlyArgValValLeuSerGly 
3661 TGGCCGCGTATTGCCTGTCAACAGGCTGCGTGGTCATAGTGGGCAGGGTCGTCTTGTCCG 
ACCGGCGCATAACGGACAGTTGTCCGACGCACCAGTATCACCCGTCCCAGCAGAACAGGC 

LysProAlallelleProAspArgGluValLeuTyrArgGluPheAspGluMetGluGlu 
3721 GGAAGCCGGCAATCATACCTGACAGGGAAGTCCTCTACCG AGAGTTCG ATG AGATGGAAG 
CCTTCGGCCGTTAGTATGGACTGTCCCTTCAGGAGATGGCTCTCAAGCTACTCTACCTTC 

CysSerGlnHisLeuProTyrlleGluGlnGlyMetLMetLeuAlaGluGlnPheLysGln 
3781 AGTGCTCTCAGCACTTACCGTACATCGAGCAAGGG ATG ATGCTCGCCG AGCAGTTCAAGC 
TCACGAGAGTCGTGAATGGCATGTAGCTCGTTCCCTACTACGAGCGGCTCGTCAAGTTCG 

LysAlaLeuGlyLeuLeuGlnThrAlaSerArgGlnAlaGluVallleAlaProAlaVal 
3841 AGAAGGCCCTCGGCCTCCTGCAGACCGCGTCCCGTCAGGCAGAGGTTATCGCCCCTGCTG 
TCTTCCGGGAGCCGGAGGACGTCTGGCGCAGGGCAGTCCGTCTCCAATAGCGGGGACGAC 

GlnThrAsnTrpGlnLysLeuGluThrPheTrpAlaLysHisMetTroAsnPhelleSer 
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3 90 1 TCCACACC.i-ACTGCC.Su^AAACTCGACACCTTCTCCCCCA.-GCATATGTCCAACTTCATCA 
AboTCTGGTiGACCGTTTTTGAGCTC TGCAAGACCCGCTTCGTATACACCTTGAACTAGT 

GlyrieGlnTyrLeuAliGlyLeuSerThrLeuPrcGlyAsnProAlalleAlaSerLeu 

3 9o i. GTGGGATACAATACTTGGCGGGCTTGTCAACGCTGCCTCGTAACCCCGCCATTGCTTCAT 

C5.CCCTATGTTATGAACCGCCCGAACAGTTGCGACGGACCATTGGGGCGGTAACGAAGTA 

MetAlaPheThrAlaAlaValThrSerProLeuThrThrSerGlnThrLeuLeuPheAsn 
4021 TGATGGCTTTTACAGCTGCTGTCACrAGCCCACTA.ACCACr.'.3CCAAACCCTCCTCTTCA 
ACTACCGAAAATGTCGACGACAGTGGTCGGGTGATrCGTfJATCGGTTTGGGAGGAGAAGT 

IleLeuGlyGlyTrpValAlaAlaGlnLeuAlaAlaProGlyAlaAlaThrAlaPheVal 
4031 ACATATTGGGGGGG TGGGTGGCTGCCCAGCTCGCCGCCCCCGGTGCCGCTACTGCCTTTG 
TGTATAACCCCCCCACCCACCGACGGGTCGAGCGGCGGGGGCCACGGCGATGACGGAAAC 

GlyAlaGlyLeuAlaGlyAlaAlalleGlySerValGlyLeuGlyLysValLeuIleAsp 
4 141 TGGGCGCTGGCTTAGCTGGCGCCGCCATCGGCAGTGTTGGACTGGGGAAGGTCCTCATAG 
ACCCGCGACCr-'J^TCGACCGCGGCGGTAGCCGTCACAACCTGACCCCTTCCAGGAGTATC 

IleLeuAlaGlyTyrGlyAlaGlyValAlaGlyAlaLeuValAlaPheLysIleMetSer 

4 201 ACATCCTTGCAGGGTATGGCGCGGGCGTGGCGGGAGCTCTTGTGGCATTCAAGATCATGA 

TGTAGGAACGTCCCATACCGCGCCCGCACCGCCCTCGAGAACACCGTAAGTTCTAGTACT 

GlyGluValProSerThrGluAspLeuValAsnLeuLeuProAlalleLeuSerProGly 
4 261 GCGGTGAGGTCCCCTCCACGGAGGACCTGGTCAATCTACTGCCCGCCATCCTCTCGCCCG 
CGCCACTCCAGGGGAGGTGCCTCCTGGACCAGTTAGATGACGGGCGGTAGGAGAGCGGGC 

AlaLeuValValGlyValValCysAlaAlalleLeuArgArgHisValGlyPrcsGlyGlu 
4 321 GAGCCCTCGTAGTCGGCGTGGTCTGTGCAGCAATACTGCGCCGGCACGTTGGCCCGGGCG • 
CTCGGGAGCATCAGCCGCACCAGACACGTCGTTATGACGCGGCCGTGCAACCGGGCCCGC = 

^-.a, ^^iyAlaValGL-'-rpMetAsnArgLeuIleAlaPheAlaSerArgGlyAsnHisValSer 
4 3 81 AGGGGGCAGTGCAGTGGATGAACCGGCTGATAGCCTTCGCCTCCCGGGGGAACCATGTTT 
TCCCCCGTCACGTCACCTACTTGGCCGACTATCGGAAGCGGAGGGCCCCCTTGGTACAAA 

ProThrHisTyrValProGluSerAspAlaAlaAlaArgValThrAlalleLeuSerSer 
4 4 1 1 CCCCCACGCACTACGTGCCGGAGAGCGATGCAGCTGCCCGCGTCACTGCCATACTCAGCA 
GGGGGTGCGTGATGCACGGCCTCTCGCTACGTCGACGGGCGCAGTGACGGTATGAGTCGT 

, - - , LeuThrValThrGlnLeuLeuArgArgLeuHisGlnTrpIleSerSerGluCysThrThr 
4501 GCCTCACTGTAACCCAGCTCCTGAGGCG ACTGCACCAGTGGATAAGCTCGGAGTGTACCA 
CGGAGTGACATTGGGTCGAGGACTCCGCTGACGTGGTCACCTATTCGAGCCTCACATGGT 

. r^, ProCysSerGlySerTrpLeuArgAspIleTrpAspTrpIleCysGluValLeuSerAsp 
4561 CTCCATGCTCCGGTTCCTGGCTAAGGGACATCTGGGACTGGATATGCGAGGTGTTGAGCG 
GAGGTACGAGGCCAAGGACCGATTCCCTGTAGACCCTGACCTATACGCTCCACAACTCGC 

PheLysThrTrpLeuLysAlaLysLeuMetProGlnLeuProGlylleProPheValSer 
4621 ACTTTAAG ACCTGGCTAAAAGCTAAGCTCATGCCACAGCTGCCTGGGAICCCCTTTGTGT 
TGAAATTCTGGACCGATTTTCGATTCGAGTACGGTGTCGACGGACCCTAGGGGAAACACA 

-^a, CysGInArgGlyTyrLysGlyValTrpArgValAspGlyrieMetHisThrArgCysHis 
4681 CCTGCCAGCGCGGGTATAAGGGGGTCTGGCGAGTGGACGGCATCATGCACACTCGCTGCC 
GGACGGTCGCGCCCATATTCCCCCAGACCGCTCACCTGCCGTAGTACGTGTGAGCGACGG 

CysGlyAlaGluIleThrGlyHisValLysAsnGlyThrMetArglleValGlyProArg 
4741 ACTGTGGAGCTGAGATCACTGGACATGTCAAAAACGGGACGATGAGGATCGTCGGTCCTA 
TGACACCTCGACTCTAGTGACCTGTACAGTTTTTGCCCTGCTACTCCTAGCAGCCAGGAT 

ThrCy sArgAsnWetTrpSe rCl yThrP hePro n eAsnAlaTyrThrThrG ly ProCys 
48C1 GGACCTGCAGGAACATGTGGAGTGGGACCTTCCCCATTAATGCCTACACCACGGGCCCCT 
CCTGGACGTCCTTGTACACCTCACCCTGGAAGGGGTAATTACGGATGTGGTGCCCGGGGA 

ThrProLeuProAlaProAsnTyrThrPheAlaLeuTrpArgValSerAlaGluGluTyr 
4861 GTACCCCCCTTCCTGCGCCGAACTACACGTTCGCGCTATGGAGGGTGTCTGCAG AGGAAT 
CATGGGGGGAAGGACGCGGCTTGATGTGCAAGCGCGATACCTCCCACAGACGTCTCCTTA 
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J o -M .^^-Ih^ lu- ^e-ArgoinVaiolyAsoPheHis fyrVal-hrGlyMe cThrThrAspAsnLeu 
4 9 - i ATGTGv.AGaTAAG«v.AGGTGGGGGAv.TTCCACTACGTGACGGGTATGACTACTGACAATC 
TACACCTCTATTCCGTCCACCCCCTGAAGGTGATGCACTGCCCATACTGATGACTGTTAG 

LysCysProCysGlnValProSerProGluPhePheThrGluLeuAspGiyValArqLeu 
4 981 TCAAATGCCCGTGCCAGGTCCCATCGCCCGAATTTTTCACAGAATTGGACGGGGTGCGCC 
AGTTTACGGGCACGGTCCAGGGTAGCGGGCTTAAAAAGTGTCTTAACCTGCCCCACGCGG 

cn , , ^,-|^^^^5^^^'^-^*^'"°^^°^y=^>"^'P^°^«"^"^9GIuGluValSerPheArgValGly 
5041 TACATAGGTTTGCGCCCCCCTGCAAGCCCTTGCTGCGGGAGGAGGTATCATTCAGAGTAG 
ATGTATCCAAACGCGGGGGGACGTTCGGGAACGACGCCCTCCTCCATAGTAAGTCTCATC 

c,n, LeuHisGluTyrProValGlySerGlnLeuProCysGluProCluProAspValAlaVal 
5101 GACTCCACGAATACCCGGTAGGGTCGCAATTACCTTGCGAGCCCGAACCGGACGTGGCCG 
CTGAGGTGCTTATGGGCCATCCCAGCGTTAATGGAACGCTCGGGCTTGGCCTGCACCGGC 

LeuThrSer.MetLeuThrAspProSerHisi: aThrAlaGluAlaAlaGlyArgArqLeu 
5 1 o 1 TGTTG ACGTC^TGCTCACTG ATCCCTCCC ATATAACAGC AG AGGCGGCCGGGCG AAGGT 
ACAACTGCAGGTACGAGTGACTAGGGAGGGTATATTGTCGTCTCCGCCGGCCCGCTTCCA 

AlaArgGlySerProProSerValAlaSerSerSerAlaSerGlnLeuSerAlaProSer 
5221 TGGCGAGGGGATCACCCCCCTCTGTGGCCAGCTCCTCGGCTAGCCAGCTATCCGCTCCAT 
ACCGCTCCCCTAGTGGGGGGAGACACCGGTCGAGGAGCCGATCGGTCGATAGGCGAGGTA 

c,oi ' ^,i:*'i^ysAlaThrCysThrAlaAsnHisAspSerProAspAlaGluLeuIleGluAlaAsn 
5281 CTCTCAAGGCAACTTGCACCGCTAACCATG ACTCCCCTGATGCTGAGCTCATAGAGGCCA 
GAGAGTTCCGTTGAACGTGGCGATTGGTACTGAGGGGACTACGACTCGAGTATCTCCGGT. 

s^Ai ^^J:^i^:«3P^gGlnGluMetGlyGlyAsnIleThrArgValGluSerGluAsnLysVal 
5341 ACCTCCTATGGAGGCAGGAGATGGGCGGCAACATCACCAGGGTTGAGTCAGAAAACAAAG 
TGGAGGATACCT'-CGTCCTCTACCCGCCGTTGTAGTGGTCCCAACTCAGTCTTTTGTTTC ' 

--Y!iHi^^!ii^^P^^^^^^^PP-°^®"^*^^*^^"<21"AspGluArgGluIleSerV^ 
5401 TGGTGATTCTGGACTCCTTCGATCCGCTTGTGGCGGAGGAGGACGAGCGGGAGATCTCCG 
ACCACTAAGACCTGAGGAAGCTAGGCGAACACCGCCTCCTCCTGCTCGCCCTCTAGAGGC 

I ^•if5^"^l*^e"ArgLysSerArgArgPheAlaGlnAlaLeuProValTrpAlaArg 
54 61 T^wvwGCAGAAATCCTGCGGAAGTCTCGGAGATTCGCCCAGGCCCTGCCCGTTTGGGCGC 
ATGGGCGTCTTTAGGACGCCTTCAGAGCCTCTAAGCGGGTCCGGGACGGGCAAACCCGCG 



tggtccatggctgtccgcttccacctccaaagtcccctcctgtgcc: 
accaggtaccgacaggcgaaggtggaggtttcaggggaggacacgg; 

ArgThrValValLeuThrGluSerThrLeuSerThrAlaLeuAlaGluLeuAla'^hrArg 
5641 AGCGGACGGTGGTCCTCACTGAATCAACCCTATCTACTGCCTTGGCCGAGCTCGCCACCA 
TCGCCTGCCACCAGGAGTGACTTAGTTGGGATAGATGACGGAACCGGCTCGAGCGGTGGT 

SerPheGlySerSerSerThrSerGlylleThrGlyAspAsnThrThrThrSerSerGlu 
5701 GAAGCTTTGGCAGCTCCTCAACTTCCGGCATTACGGGCGACAATACGACAACATCCTCTG 
CTTCGAAACCGTCGAGGAGTTGAAGGCCGTAATGCCCGCTGTTATGCTGTTGTAGGAGAC 

ProAlaProSerGlyCysProProAspSerAspAlaGluSerTyrSerSerMetProPro 
5761 AGCCCGCCCCTTCTGGCTGCCCCCCCGACTCCGACGCTGAGTCCTATTCCTCCATGCCCC 
TCGGGCGGGGAAGACCGACGGGGGGGCTGAGGCTGCGACTCAGGATAAGGAGGTACGGGG 

LeuGluGlyGl-jProGlyAspProAspLeuSerAspGlySerTrpSerThrValSerSer 
53^1 CCCTGGAGGGGGAGCCTGGGGATCCGGATCTTAGCGACGGGTCATGGTCAACGGTCAGTA 
GGGACCTCCCCCTCGGACCCCTAGGCCTAGAATCGCTGCCCAGTACCAGTTGCCAGTCAT 

GluAlaAsnAlaGluAspValValCysCysSerMecSerTyrSerTrpThrGlyAlaO^eu 

Fin 47- 



5 381 JTJACCCCAACGCGGACCA:CTCCTCTGCTr:CTC.^^:0TCTTACTCTTGGA'^AO;3CGCAC 
CACTCCGGTTGCGCCTCCTACACrACACGACGAGTTACAGAATGAGAACCTGTCCGCGTG 

ValThrProCysAldAlaGluGiuGlnL/sLeuProIleAsnAlaLeuSerAsnSerLeu 

5 341 TCGTCACCCCGTGCGCCGCGGAAGAACAGAAACTGCCCATCAATGCACTAw^iGCAACTCGT 

AGCAGTGGGGCACGCGGCG JCTTCTTGTCTTTG ACGGGTAGTTACG TGATTCGTTG AGCA 

LeuArgHisHisAsnLeuValTyrSerThrThrSerAxgSerAlaCysGlnArgGlnLys 
6001 TGCTACGTCACCACAATTTGGTG'^ATTCCACCACCTCACGCAGTGCTTGCCAAAGGCAGA 
ACGATGCAGTGGTGTTAAACCACATAAGGTGGTGGAGTGCGTCACGAACGGTTTCCGTCT 

LysValThxPheAspArgLeuGlnValLeuAspSerHisTyrGlnAspValLeuLysGlu 
'6061 AGAAAGTCACATTTGACAGACTGCAAGTTCTGGACAGCCATTACCAGGACGTACTCAAGG 
TCTTTCAGTGTAAACTGTCTGACGTTCAAGACCTGTCGGTAATvjGTCCTGCATGAGTTCC 

ValLysAlaAlaAlaSerLysValLysAlaAsnLeuLeuSerVa IGluGluAlaCysSer 
6121 AGGTTAAAGCAGCGGCGTCAA.\AGTGAAGGCTAACTTGCTATCCGTAGAGG AAGCTTGCA 
TCCAATTTrGTCGCCGCAGTTTTCACTTCCGArTGAACGATAGGCATCTCCTTCGAACGT 

LeuThrProProHisSerAlaLysSerLysPheGlyTyrGlyAlaLysAspValArgCys 
6131 GCCTGACGCCCCCACACTCAGCCAAATCCAAGTTTGGTTATGGGGCAAAAGACGTCCGTT 
CGGACToCGGGGGTGTGAGTCGGTTTAGGTTCAAACCAATACCCCGTTTTCTGCAGGCAA 

HisAlaArgLysAlaValThrHisIleAsnSerValTrpLysAspLeuLeuGluAspAsn 

6 241 GCCATGCCAGAAAGGCCGTAACCCACATCAACTCCGTGTGGAAAGACCTTCTGGAAGACA 

CGGTACGGTCTTTCCGGCATTGGGTGTAGTTGAGGCACACCTTTCTGGAAGACCTTCTGT 

ValThrProIleAspThrThrlleMetAlaLysAsnGluValPheCysValGlnProGlu 
6301 ATGTAACACCAATAG ACACTACCATCATGGCTAAGAACG AGGTTTTCTGCGTTCAGCCTG 
TACATTGTGGTTATCTGTGATGGTAGTACCGATTCTTGCTCCAAAAGACGCAAGTCGGAC 

LysGlyGl/'^rgLysProAlaArgLeuIleValPheProAspLeuGlyValArgValCys 
6361 AGAAGGGGGGTCGTAAGCCAGCTCGTCTCATCGTGTTCCCCGATCTGGGCGTGCGCGTGT 
TCTTCCCCCCAGCATTCGGTCGAGCAGAGTAGCACAAGGGGCTAGACCCGCACGCGCACA 

GluLysMetAlaLeuTyrAspValValThrLysLeuProLeuAlaVaLMetGlySerSer 
64 21 GCGAAAAGATGGCTTTGTACGACGTGGTTACAAAGCTCCCCTTGGCCGTGATGGGAAGCT 
CGCTTTTCTACCGAAACATGCTGCACCAATGTTTCGAGGGGAACCGGCACTACCCTTCGA 

TyrGlyPheGlnTyrSerProGlyGlnArgValGluPheLeuValGlnAlaTrpLysSer 
6481 CCTACGGATTCCAATACTCACCAGGACAGCGGGTTGAATTCCTCGTGCAAGCGTGGAAGT 
GGATGCCTAAGGTTATGAGTGGTCCTGTCGCCCAACTTAAGGAGCACGTTCGCACCTTCA 

LysLysThrProMetGlyPheSerTyrAspThrArgCysPheAspSerThrValThrGlu 
6541 CCAAGAAAACCCCAATGGGGTTCTCGTATGATACCCGCTGCTTTGACTCCACAGTCACTG 
GGTTCTTTTGGGGTTACCCCAAGAGCATACTATGGGCGACGAAACTGAGGTGTCAGTGAC 

SerAspIleArgThrGluGluAlalleTyrGlnCysCysAspLeuAspProGlnAlaArg 
6601 AGAGCG ACATCCGTACGGAGGAGGCAATCTACCAATGTTGTG ACCTCGACCCCCAAGCCC 
TCTCGCTGTAGGCATGCCTCCTCCGTTAGATGGTTACAACACTGGAGCTGGGGGTTCGGG 

ValAlalleLysSerLeuThrGluArgLeuTyrValGlyGlyProLeuThrAsnSerArg 
6661 GCGTGGCCATCAAGTCCCTCACCG AG AGGCTTTATGTTGGGGGCCCTCTTACCAATTCAA 
CGCACCGGTAGTTCAGGGAGTGGCTCTCCGAAATACAACCCCCGGGAGAATGGTTAAGTT 

GlyGluAsnCysGlyTyrArgArgCysArgAlaSerGlyValLeuThrThrSerCysGly 
6721 GGGGGGAGAACTGCGGCTATCGCAGGTGCCGCGCGAGCGGCGTACTGACAACTAGCTGTG 
CCCCCCTCTTGACGCCGATAGCGTCCACGGCGCGCTCGCCGCATGACTGTTGATCGACAC 

AsnThrLeuThrCysTyrlleLysAlaArgAlaAlaCysArgAlaAlaGlyLeuGlaAsp 
6781 GTAACACCCTCACTTGGTACATCAAGGCCCGGGCAGCCTGTCGAGCCGCAGGGCTCCAGG 
CATTGTGGGAGTGAACGATGTAGTTCCGGGCCCGTCGGACAGCTCGGCGTCCCGAGGTCC 

CysThrMetLeuValCysGlyAspAspLeuValVairieCysGluSerAlaGlyValGla 
6841 ACTGCACCATGCTCGTGTGTGGCG ACG ACTTAGTCGTTATCTGTG AAAGCGCGGGGGTCC 
TGACGTGGTACGAGCACACACCGCTGCTGAATCAGCAATAGACACTTTCGCGCCCCCAGG 
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GluAspAlaAxaSerLeuArgAlaPheThrGluAlaMetThrArgTyrSerAlaProPro 
6901 AGGAGG ACGCGGCG AGCCTG AG AGCCTTCACGGAGGCTATGACCAGGTACTCCGCCCCCC 
TCCTCCTGCGCCGCTCGGACTCTCGGAAGTGCCTCCGATACTGGTCCATGAGGCGGGGGG 

GlyAspProProGlnProGluTyrAspLeuGluLeuIleThrSerCysSerSerAsnVal 
5961 CTGGGGACCCCCCACAACCAGAATACGACTTGG AGCTCATAACATCATGCTCCTCCAACG 
GACCCCTGGGGGGTGTTGGTCTTATGCTGAACCTCGAGTATTGTAGTACGAGGAGGTTGC 

SerValAlaHisAspGlyAlaGlyLysArgValTTrTyrLeuThrArgAspProThrThr 
7021 TGTCAGTCGCCC2.CG ACGGCGCTGGAAAG AGGGTCTACTACCTCACCCGTGACCCTAC^ 
ACAGTCAGCGGGTGCTGCCGCGACCTTTCTCCCAGATGATGGAGTGGGCACTGGGATGTT : 

ProLeuAlaArgAlaAlaTrpGluThrAlaArgHisThrProValAsnSerTrpLeuGly 
7081 CCCCCCTCGCGAGACTTGCGTGGGAGACAGCAAGACACACTCCAGTCAATTCCTGGCTAG ^ 
GGGGGGAGCGCTCTCoaCGCACCCTCTGTCGTTCTGTGTGAGGTCAGTTAAGGACCGATC 

AsnllelleMetPheAlaProThrLcuTrpAlaArgMetlleLeuMetThrHisPhePhe 
7141 GCAACATAATCATGTTTGCCCCCAC ACTGTGGGCG AGGATGATACTG ATG ACCCATTTCT 

cgttgtattagtacaaacgggggtgtgacacccgctcctactatgactactgggtaaaga 

SerValLeuIleAlaArgAspGlnLeuGluGlnAlaLeuAspCysGluIleTyrGlyAla 
7201 TTAGCGTCCTTATAGCCAGGGACCAGCTTG AACAGGCCCTCGATTGCGAGATCTACGGGG 
AATCGCAGGAATATCGGTCCCTGGTCGAACTTGTCCGGGAGCTAACGCTCTAGATGCCCC 

CysTyrSerlleGluProLeuAspLeuProProIlelleGlnArgLeu 
7261 CCTGCTACTCCATAG AACCACTTG ATCTACCTCCAATCATTCAAAG ACTC 
GGACGATGAGGTATCTTGGTGAACTAGATGGAGGTTAGTAAGTTTCTGAG 
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